V 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 

1 » k </ 

(19) World Intellectual Property Organization 

International Bureau 




(43) International Publication Date 
15 November 2001 (15.11.2001) 



PCT 



(10) International Publication Number 

WO 01/85946 A2 



(51) International Patent Classification 7 : C12N 15/29, 

15/82, C07K 14/415, 16/16, G01N 33/68, C12Q 1/68, 
A01H 5/00 

(21) International Application Number: PCT/IB01/01307 

(22) International Filing Date: 14 May 2001 (14.05.2001) 



(25) Filing Language: 

(26) Publication Language: 

(30) Priority Data: 
60/204,045 



English 
English 

12 May 2000 ( 12.05.2000) US 



(71) Applicant (for all designated States except US): 
CROPDESIGN N.V. [BE/BE]; Technologiepark 3, 
B-9052 Zwijnaarde (BE). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): INZE, Dirk 
[BE/BE]; Driesstraat 18, B-9310 Aalst (BE). BOUDOLF, 
Veronique [BE/BE]; Koningin Fabiolalaan 70, B-9000 
Gent (BE). DE VEYLDER, Lieven [BE/BE]; Josef 
Boddaertstraat 23, B-9031 Gent (BE). ACOSTA, Juan, 
Antonio, Torres [MX/BE]; Victoriastraat 60, B-9000 
Gent (BE). MAGYAR, Zoltan [HU/BE]; Baliestraat 81, 
B-9000 Gent (BE). 



(74) Agent: DE CLERCQ, Ann; De Clercq, Brants & Part- 
ners, E. Gevaertdreef 10 a, B-9830 Sint-Martens-Latem 
(BE). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, ES, H, GB, GD, GE, GH, 
GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, 
MX, MZ, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, 
SL, TJ, TM, TR, TT, TZ, UA, UG, US, UZ, VN, YU, ZA, 
ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, 
IT, LU, MC, NL, PT, SE, TR), OAPI patent (BF, BJ, CF, 
CG, CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). 

Published: 

— without international search report and to be republished 
upon receipt of that report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



< 

^ -7-:: _ --- - - - 

0\ (54) Title: NUCLEIC ACID MOIJ3CULES ENCODING"?^ =" 

ID 
0O 

*^ * (57) Abstract: The invention provides isolated nucleic acids molecules, designated CCP nucleic acid molecules, which encode novel 
cell cycle associated polypeptides. The invention also provides antisense nucleic acid molecules, recombinant expression vectors 
containing CCP nucleic acid molecules, host cells into which the expression vectors have been introduced, and transgenic plants 
in which a CCP gene has been introduced or disrupted. The invention still further provides isolated CCP proteins, fusion proteins, 
antigenic peptides and anti-CCP antibodies. Agricultural, diagnostic, screening, and therapeutic methods utilizing compositions of 
the invention are also provided. 



WO 01/85946 



PCT/IB01/01307 



NUCLEIC ACID MOLECULES ENCODING PLANT CELL CYCLE PROTEINS 
5 AND USES THEREFOR 

Related Applications 

This application claims priority to U.S. provisional patent application serial number 
60/204,045, filed May 12, 2000. The contents of this provisional patent application are 
incorporated herein by reference in their entirety. 

10 

Background of the Invention 

Cell division plays a crucial role during all phases of plant development. The 
continuation of organogenesis and growth responses to a changing environment require 
precise spatial, temporal, and developmental regulation of cell division. 
15 The basic mechanisms controlling the progression through the cell cycle appear to 

be conserved in all higher eukaryotes, although the temporal and spatial control of cell 
division can differ largely, between organisms. Plants have unique developmental features 
which are not found in either animals or fungi. First, due to the presence of a rigid cell 
wall, plant cells cannot move and consequently organogenesis is dependent on cell division 

20 and cell expansion at the site of formation of new organs. Secondly, cell divisions are 

confined to specialized regions, called meristems. These meristems continuously produce 
new cells which, as they move away from the meristem, become differentiated. The 
meristem identity itself can change from a vegetative to a reproductive phase, resulting in 
the formation of flowers. Thirdly, plant development is largely post-embryonic. During 

25 embryogenesis, the main developmental event is the establishment of the root-shoot axis. 
Most plant growth occurs after germination, by iterative development at the meristems. 
Lastly, as a consequence of the sessile life of plants, development and cell division are, to a 
large extent, influenced by environmental factors such as light, gravity, wounding, 
nutrients, and stress conditions. All these features are reflected in a plant-specific 

30 regulation of the factors controlling cell division. 

The unparalleled potential of plants for continuous organogenesis and plastic 
growth also relies on the competent or active state of the cell division apparaturs. The 
discovery of a common mechanism underlying the regulation of the cell cycle in yeasts 
and animals has led to efforts to extend these findings to the plant kingdom and is leading 

35 to research aimed at converting the gathered knowledge into useful traits introduced in 
transgenic plants . r 
When eukaryotic cells and, thus, also plant cells divide they go through a highly 
ordered sequence of events collectively termed as the "cell cycle." Briefly, DNA 
replication or synthesis (S) and mitotic segregation of the chromosomes (M) occur with 

40 intervening gap phases (Gl and G2) and the phases follow the sequence G1-S-G2-M. Cell 
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division is completed after cytokinesis, the last step of the M-phase. Cells that have exited 
the cell cycle and have become quiescent are said to be in the GO phase. Cells at the GO 
stage can be stimulated to reenter the cell cycle at the Gl phase. The transition between 
the different phases of the cell cycle are basically driven by the sequential 
activation/inactivation of a kinase (called "cyclin-dependent kinase", "CDC" or "CDK") 
by different agonists. 

Proteins called cyclins are required for kinase activation. Cyclins are also 
important for targeting the kinase activity to a given subset of substrate(s). Other factors 
regulating CDK activity include CDK inhibitors (CKIs or ICKs, KIPs, CEPs, INKs), CDK 
activating kinase (CAK) and CDK phosphatase (CDC25) (Mironov et al (1999) Plant Cell 
1 l,:509-522 and Won K. et al (1996) EMBO J, 15, 4182-4193). 

Summary of the Invention 

The present invention is based, at least in part, on the discovery of novel plant 
nucleic acid molecules and polypeptides encoded by such nucleic acid molecules, referred 
to herein as "cell cycle proteins" or "CCP." The CCP nucleic acid and polypeptide 
molecules of the present invention are useful as modulating agents in regulating cell cycle 
progression in, for example, plants. Accordingly, in one aspect, this invention provides 
isolated nucleic acid molecules encoding CCP polypeptides, as well as nucleic acid 
fragments suitable as primers or hybridization probes for the detection of CCP-encoding 
nucleic acids. 

In one embodiment, a CCP nucleic acid molecule of the invention is at least 60%, 
65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or more identical to the nucleotide sequence 
(e.g., to the entire length of the nucleotide sequence) of SEQ ID NO:l-66 or 228-239, or a 
complement thereof. 

In a preferred embodiment, the isolated nucleic acid molecule includes the 
nucleotide sequence shown in SEQ ID NO: 1-66 or 228-239, or a complement thereof. In 
another preferred embodiment, an isolated nucleic acid molecule of the invention encodes 
the amino acid sequence of a plant CCP polypeptide. 

Another embodiment of the invention features nucleic acid molecules, preferably 
CCP nucleic acid molecules, which specifically detect CCP nucleic acid molecules relative 
to nucleic acid molecules encoding non-CCP polypeptides. For example, in one 
embodiment, such a nucleic acid molecule is at least 15, 20, 25, 30, 40, 50, 100, 150, 200, 
250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, or 800 nucleotides in length and 
hybridizes under stringent conditions to a nucleic acid molecule comprising the nucleotide 
sequence shown in SEQ ED NO: l-66>or 228-239, or a complement thereof. 

In other preferred embodiments, the nucleic acid molecule encodes a naturally 
occurring allelic variant of a plant CCP polypeptide, wherein the nucleic acid molecule 
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hybridizes to the nucleic acid molecule of SEQ ID NO: 1-66 or 228-239 under stringent 
conditions. 

Another embodiment of the invention provides an isolated nucleic acid molecule 
which is antisense to a CCP nucleic acid molecule, e.g., the coding strand of a CCP nucleic 
5 acid molecule. 

Another aspect of the invention provides a vector comprising a CCP nucleic acid 
molecule. In certain embodiments, the vector is a recombinant expression vector. In 
another embodiment, the invention provides a host cell containing a vector of the 
invention. The invention also provides a method for producing a CCP polypeptide, by 

10 culturing in a suitable medium a host cell of the invention, e.g., a. plant host cell such as a 
host monocot plant cell (e.g., rice , wheat or com) or a dicot host cell (e.g., Arabidopsis 
thaliana, oilseed rape, or soybeans) containing a recombinant expression vector, such that 
the polypeptide is produced. 

Another aspect of this invention features isolated or recombinant CCP 

15 polypeptides. In one embodiment, an isolated CCP polypeptides has one or more of the 
following domains: a "cyclin destruction box", a "cyclin box motif 1", a "cyclin box motif 
2", a "CDC2 motif", a "CDK phosphorylation site", a "nuclear localization signal", a "Cy- 
like box", an "Rb binding domain", a "DEF domain", a "DNA binding domain", a "DCB1 
domain", a "DCB2 domain" and/or a "SAP domain". 

20 In a preferred embodiment, a CCP polypeptide includes at least one or more of the 

following domains: a "cyclin destruction box", a "cyclin box motif 1", a "cyclin box motif 
2", a "CDC2 motif', a "CDK phosphorylation site", a "nuclear localization signal", a "Cy- 
like box", an "Rb binding domain", a "DEF domain", a "DNA binding domain", a "DCB1 
domain", a "DCB2 domain" and/or a "SAP domain", and has an amino acid sequence at 

25 least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, 99% or more 
identical to the amino acid sequence of SEQ ID NO:67-132, 205, 21 1, 215-216, or 220- 
227. 

In another preferred embodiment, a CCP polypeptide includes at least one or more 
of the following domains: a "cyclin destruction box", a "cyclin box motif 1", a "cyclin box 
30 motif 2", a "CDC2 motif', a '*CDK phosphorylation site", a "nuclear localization signal", a 
"Cy-like box", an "Rb binding domain", a "DEF domain", a "DNA binding domain", a 
"DCB1 domain", a "DCB2 domain" and/or a SAP domain and has a CCP activity (as 
described herein) . 

In yet another preferred embodiment, a CCP polypeptide includes one or more of 
35 the following domains: a "cyclin destruction box", a "cyclin box motif 1", a "cyclin box 
motif 2", a "CDC2 motif 5 , a "CDK phosphorylation site", a "nuclear localization signal", a 
"Cy-like box", an "Rb binding domain", a "DEF domain", a "DNA binding domain", a 
"DCB1 domain", a "DCB2 domain" and/or a SAP domain and is encoded by a nucleic acid 
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molecule having a nucleotide sequence which hybridizes under stringent hybridization 
conditions to a nucleic acid molecule comprising the nucleotide sequence of SEQ ID 
NO:l-66 or 228-239. 

In another embodiment, the invention features fragments of the polypeptide having 
5 the amino acid sequence of SEQ ID NO:67- 1 32, 205, 2 11 ,21 5-2 16, or 220^227, wherein 
the fragment comprises at least 15 amino acids (e.g. , contiguous amino acids) of the amino 
acid sequence of SEQ IDNO:67-132, 205, 211, 215-216, or 220-227. In another 
embodiment, a CCP polypeptide has the amino acid sequence of SEQ ID NO:67-132, 205, 
211, 215-216, or 220-227. 

10 In another embodiment, the invention features a CCP protein which is encoded by a 

nucleic acid molecule consisting of a nucleotide sequence at least about 50%, 55%, 60%, 
65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, 99% or more identical to a nucleotide 
sequence of SEQ ID NO: 1 -66 or 228-239, or a complement thereof. This invention further 
features a CCP polypeptide, which is encoded by a nucleic acid molecule consisting of a 

15 nucleotide sequence which hybridizes under stringent hybridization conditions to a nucleic 
acid molecule comprising the nucleotide sequence of SEQ ID NO: 1-66 or 228-239, or a 
complement thereof. 

In another embodiment the invention provides transgenic plants (e.g. , monocot or 
dicot plants) containing an isolated nucleic acid molecule of the present invention. For 

20 example, the invention provides transgenic plants containing a recombinant expression 
cassette including a plant promoter operably linked to an isolated nucleic acid molecule of 
the present invention. The present invention also provides transgenic seed from the 
transgenic plants. In another embodiment the invention provides methods of modulating, 
in a transgenic plant, the expression of the nucleic acids of the invention. 

25 The proteins of the present invention or portions thereof, e.g. , biologically active 

portions thereof, can be operatively linked to a non-CCP polypeptide (e.g., heterologous 
amino acid sequences) to form fusion proteins. The invention further features antibodies, 
such as monoclonal or polyclonal antibodies, that specifically bind polypeptide of the 
invention, preferably CCP polypeptide. In addition, the CCP polypeptide or biologically 

30 active portions thereof can be incorporated into pharmaceutical compositions, which 
optionally include pharmaceutically acceptable carriers. 

In another aspect, the present invention provides a method for detecting the 
presence of a CCP nucleic acid molecule, polypeptide in a biological sample by contacting 
the biological sample with an agent capable of detecting a CCP nucleic acid molecule, 

35 polypeptide such that the presence of a CCP nucleic acid molecule, polypeptide is detected 
in the biological sample. 

In another aspect, the present invention provides a method for detecting the 
presence of CCP activity in a biological sample by contacting the biological sample with 
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an agent capable of detecting an indicator of CCP activity such that the presence of CCP 
activity is detected in the biological sample. 

In another aspect, the invention provides a method for modulating CCP activity 
comprising contacting a cell capable of expressing CCP with an agent that modulates CCP 
5 activity such that CCP activity in the cell is modulated. In one embodiment, the agent 
inhibits CCP activity. In another embodiment, the agent stimulates CCP activity. In one 
embodiment, the agent is an antibody that specifically binds to a CCP polypeptide. In 
another embodiment, the agent modulates expression of CCP by modulating transcription 
of a CCP gene or translation of a CCP mRNA. In yet another embodiment, the agent is a 

10 nucleic acid molecule having a nucleotide sequence that is antisense to the coding strand of 
a CCP mRNA or a CCP gene. 

In one embodiment, the methods of the present invention are used to increase crop 
yield, improve the growth characteristics of a plant (such as growth rate or size of specific 
tissues or organs in the plant), modify the architecture or morphology of a plant, improve 

15 tolerance to environmental stress conditions (such as drought, salt, temperature, nutrient or 
deprivation), or improve tolerance to plant pathogens (e.g., pathogens that abuse the cell 
cycle) by modulating CCP activity in a cell. In one embodiment, the CCP activity is 
modulated by modulating the expression of a CCP nucleic acid molecule. In yet another 
embodiment, the CCP activity is modulated by modulating the activity of a CCP 

20 polypeptide. Modulators of CCP activity include, for example, a CCP nucleic acid or 
polypeptide. 

The present invention also provides diagnostic assays for identifying the presence 
or absence of a genetic alteration characterized by at least one of (i) aberrant modification 
or mutation of a gene encoding a CCP polypeptide; (ii) mis-regulation of the gene; and (iii) 

25 aberrant post-translational modification of a CCP polypeptide, wherein a wild-type form of 
the gene encodes a protein with a CCP activity. 

In another aspect the invention provides methods for identifying a compound that 
binds to or modulates the activity of a CCP polypeptide, by providing an indicator 
composition comprising a CCP polypeptide having CCP activity, contacting the indicator 

30 composition with a test compound, and determining the effect of the test compound on 
CCP activity in the indicator composition to identify a compound that modulates the 
activity of a CCP polypeptide. The identified compounds may be used as herbicides or 
plant growth regulators. 

35 Other features and advantages of the invention will be apparent from the following 

detailed description and claims. 



WO 01/85946 



-6- 



PCT/IB01/01307 



Brief Description of the Drawings 

Figure 1 depicts the cDNA sequence and predicted amino acid sequence of the 
Arabidopsis thaliana CCPl. The complete nucleotide sequence (Figure 1A) corresponds 
to nucleic acids 1 to 1715 of SEQ ID NO:39. The complete amino acid sequence (Figure 
IB) corresponds to amino acids 1 to 460 of SEQ ID NO: 1 05. Underlined in Figure 1 A and 
Figure IB are the partially characterized nucleotide (SEQ ID NO:l) and predicted partial 
amino acid (SEQ ID NO:67) sequence, respectively. Further indicated in Figure 1 A are 
the stop and start codons (both in black shaded boxes) which are part of the primers (grey 
shaded boxes) used to amplify the coding region of CGP1 by PCR. The SEQ ID NOs of 
the primers used can be found in Table III. Indicated in Figure IB are the cyclin 
destruction box (black shaded box) and the cyclin box motifs 1 and 2 (both in gray shaded 
boxes). 

Figure 2 depicts the cDNA sequence of the Arabidopsis thaliana CCP2. The 
complete nucleotide sequence corresponds to nucleic acids 1 to 2195 of SEQ ID NO:40. 
Underlined is the partially characterized nucleotide (SEQ ID NO:2) sequence. Nucleotide 
sequence differences between SEQ ID NO:40 and SEQ ID NO:2 are depicted. Indicated 
are the stop and start codons (both in black shaded boxes) which are part of the primers 
(grey shaded boxes) used to amplify the coding region of CCP2 by PCR. SEQ ID NOs of 
the primers used can be found in Table III. 

Figure 3 depicts the predicted amino acid sequence of the Arabidopsis thaliana 
CCP2. The complete amino acid sequence corresponds to amino acids 1 to 664 of SEQ ID 
NO:106. Underlined is the predicted partial amino acid (SEQ ID NO:68) sequence. 

Figure 4 depicts the cDNA sequence and predicted amino acid sequence of the 
Arabidopsis thaliana CCP3. The complete nucleotide sequence (Figure 3 A) corresponds 
to nucleic acids 1 to 1413 of SEQ ID NO:41. The complete amino acid sequence (Figure 
3B) corresponds to amino acids 1 to 450 of SEQ ID NO:69. Underlined in Figure 3A and 
Figure 3B are the partially characterized nucleotide (SEQ ID NO:3) and predicted partial 
amino acid (SEQ ID NO:69) sequences, respectively. Indicated in Figure 3A are the stop 
and start codons (both in black shaded boxes) which are part of the primers (grey shaded 
boxes) used to amplify the coding region of CCP3 by PCR. SEQ ID NOs of the primers 
used can be found in Table III. Nucleotide sequence differences between SEQ ID NO:41 
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and SEQ ID NO:3 are depicted Indicated in Figure 3B are the cyclin destruction box (black 
shaded box) and the cyclin box motifs 1 and 2 (both in gray shaded boxes). 

Figure 5 depicts the cDNA sequence and predicted amino acid sequence of the 
Arabidopsis thaliana CCP4. The complete nucleotide sequence (Figure 5 A) corresponds 
5 to nucleic acids 1 to 672 of SEQ ID NO:4. The complete amino acid sequence (Figure 5B) 
corresponds to amino acids 1 to 223 of SEQ ID NO:70. Indicated in Figure 5A are stop 
and start codon (both in black shaded boxes) which are part of the primers (grey shaded 
boxes) used to amplify the coding region of CCP4 by PCR. SEQ ID NOs of the primers 
used can be found in Table III. Indicated in Figure 5B is the CDK phosphorylation site 
1 0 (black shaded box) . 

Figure 6 depicts the cDNA sequence and predicted amino acid sequence of the 
Arabidopsis thaliana CCP5. The complete nucleotide sequence (Figure 6A) corresponds 
to nucleic acids 1 to 1287 of SEQ ID NO:5. The complete amino acid sequence (Figure 
6B) corresponds to amino acids 1 to 429 of SEQ ID NO:71 . Indicated in Figure 6A are the 
15 stop and start codons (both in black shaded boxes) which are part of the primers (grey 
shaded boxes) used to amplify the coding region of CCP5 by PCR. SEQ ID NOs of the 
primers used can be found in Table III. Indicated in Figure 6B are the cyclin destruction 
box (black shaded box) and the cyclin box motifs 1 and 2 (both in gray shaded boxes). 

Figure 7depicts the cDNA sequence of the Arabidopsis thaliana CCP6. The 
20 complete nucleotide sequence corresponds to nucleic acids 1 to 2766 of SEQ ID NO:42. 
Underlined is the partially characterized nucleotide (SEQ ED NO:6) sequence. Indicated 
are the stop and start codons (both in black shaded boxes) which are part of the primers 
(grey shaded boxes) used to amplify the coding region of CCP6 by PCR. SEQ ID NOs of 
the primers vised can be found in Table III. Nucleotide sequence differences between SEQ 
25 ID NO:42 and SEQ ID NO:6 are depicted. 

Figure 8 depicts the predicted amino acid sequence of the Arabidopsis thaliana 
CCP6. The complete amino acid sequence corresponds to amino acids 1 to 901 of SEQ 
ID NO: 108. Underlined is the predicted partial amino acid (SEQ ID NO:72) sequence. 

Figure 9 depicts the cDNA sequence and predicted amino acid sequence of the 
30 Arabidopsis thaliana CCP7/CCP8. The complete nucleotide sequence (Figure 9A) 
corresponds to nucleic acids 1 to 1260 of SEQ ID NO:43. The complete amino acid 
sequence (Figure 9B) corresponds to amino acids 1 to 358 of SEQ ID NO:109. Underlined 
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in Figure 9A and Figure 9B are the partially characterized nucleotide (SEQ ID NO:7) and 
predicted partial amino acid (SEQ ID NO:73) sequence, respectively. Italic sequences in 
Figure 9A and Figure 9B correspond to the partially characterized nucleotide (SEQ ID 
NO:8) and amino acid (SEQ ID NO:74) sequence, respectively, of another clone found 
5 independently to interact with an AtE2F protein in a yeast two-hybrid screen. Indicated in 
Figure 9A are the stop and start codons (both in black shaded boxes) which are part of the 
primers (grey shaded boxes) used to amplify the coding region of CCP7/8 by PGR. SEQ 
ID NOs of the primers used can be found in Table III. Nucleotide sequence differences 
between SEQ ID NO:43 and SEQ ID NO:7-8 are depicted. 
10 Figure 10 depicts the cDNA sequence and predicted amino acid sequence of the 

Arabidopsis thaliana CCP9. The complete nucleotide sequence (Figure 10A) corresponds 
to nucleic acids 1 to 1308 of SEQ IDNO:9. The complete amino acid sequence (Figure 
10B) corresponds to amino acids 1 to 436 of SEQ ID NO:75. Indicated in Figure 10A are 
the stop and start codons (both in black shaded boxes) which are part of the primers (grey 
15 shaded boxes) used to amplify the coding region of CCP9 by PCR. SEQ ID NOs of the 
primers used can be found in Table HI. Indicated in Figure 1 OB are the cyclin destruction 
box (black shaded box) and the cyclin box motifs 1 and 2 (both in gray shaded boxes). 

Figure 11 depicts the cDNA sequence and predicted amino acid sequence of the 
Arabidopsis thaliana CCP10. The complete nucleotide sequence (Figure 1 1 A) 
20 corresponds to nucleic acids 1 to 1006 of SEQ ID NO: 1 0. The complete amino acid 

sequence (Figure 1 IB) corresponds to amino acids 1 to 254 of SEQ ID NO:76. Indicated in 
Figure 1 1 A are the stop and start codons (both in black shaded boxes) which are part of the 
primers (grey shaded boxes) used to amplify the coding region of CCP10 by PCR. SEQ ID 
NOs of the primers used can be found in Table III. 
25 Figure 12 depicts the cDNA sequence and predicted amino acid sequence of the 

Arabidopsis thaliana CCP1 1 . The complete nucleotide sequence (Figure 12 A) 
corresponds to nucleic acids 1 to 653 of SEQ ID NO:44. Indicated in Figure 12A are the 
stop and start codons (both in black shaded boxes) which are part of the primers (grey 
shaded boxes) used to amplify the coding region of CCP1 1 by PCR. SEQ ID NOs of the 
30 primers used can be found in Table III. However, during prediction of the open reading 
frame a frame shift was introduced which effected the CCP1 1 open reading frame. The 
stop codon indicated in italics in a black shaded box is the putative correct stop codon. 
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The amino acid sequence in Figure 12B corresponds to amino acids 1 to 86 of SEQ ID 
NO:77, the protein encoded by the initially identified open reading frame of SEQ ID 
NO:l 1. The putative correct complete amino acid sequence in Figure 12C corresponds to 
amino acids 1 to 98 of SEQ ID NO: 110. 
5 Figure 13 depicts the cDNA sequence and predicted amino acid sequence of the 

Arabidopsis thaliana CCP12/13. The complete nucleotide sequence (Figure 13A) 
corresponds to nucleic acids 1 to 1266 of SEQ ID NO:45. The complete amino acid 
sequence (Figure 13B) corresponds to amino acids 1 to 385 of SEQ ID NO:l 1 1. Double 
underlined in Figure 13 A and Figure 13B are the partially characterized 3' nucleotide 

10 (SEQ ID NO: 12) and C-tenninal predicted partial amino acid (SEQ ID NO:78) sequence, 
respectively. Single underlined in Figure 13A and Figure 13B are the partially 
characterized 5' nucleotide (SEQ ID NO: 13) and N-terminal predicted partial amino acid 
(SEQ ID NO:79) sequences, respectively. Indicated in Figure 13A are the stop and start 
codons (both in black shaded boxes) and the primers (grey shaded boxes) used to amplify 

15 the coding region of CCP12/13 by PCR. SEQ ID NOs of the primers used can be found in 
Table IB. Nucleotide sequence differences between SEQ ID NO:45 and SEQ ID NO: 12 are 
depicted. 

Figure 14 depicts the cDNA sequence and predicted amino acid sequence of the 
Arabidopsis thaliana CCP14. The complete nucleotide sequence (Figure 14A) 

20 corresponds to nucleic acids 1 to 1520 of SEQ ID NO:46. The complete amino acid 
sequence (Figure 14B) corresponds to amino acids 1 to 465 of SEQ ID NO:l 12. 
Underlined in Figure 14A and Figure 14B are the partially characterized nucleotide (SEQ 
ID NO:14) and predicted partial amino acid (SEQ ID NO:80) sequence, respectively. 
Indicated in Figure 14A are the stop and start codons (both in black shaded boxes) which 

25 are part of the primers (grey shaded boxes) used to amplify the coding region of CCP14 by 
PCR. SEQ ID NOs of the primers used can be found in Table HI. 

Figure 15 depicts the cDNA sequence and predicted amino acid sequence of the 
Arabidopsis thaliana CCP15. The complete nucleotide sequence (Figure 15 A) 
corresponds to nucleic acids 1 to 1 142 of SEQ ID NO:47. The complete amino acid 

30 sequence (Figure IB) corresponds to amino acids 1 to 313 of SEQ ID NO:l 13. Underlined 
in Figure 15A and Figure 15B are the partially characterized nucleotide (SEQ ID NO: 15) 
and predicted partial amino acid (SEQ ID NO:81) sequence, respectively. Indicated in 
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Figure 15A are the stop and start codons (both in black shaded boxes) which are part of the 
primers (grey shaded boxes) used to amplify the coding region of CCP15 by PGR- SEQ 
ID NOs of the primers used can be found in Table IE. Nucleotide sequence differences 
between SEQ ID NO:47 and SEQ ID NO: 1 5 are depicted. Indicated in Figure 1 5B are the 
5 PSTTLRE motif (boxed) characteristic for the subclass of plant PSTTLRE CDC2 kinases. 
Further indicated in Figure 15B are three CDC2 motifs (black shaded box, grey shaded box 
and double underlined). Other residues conserved in CDC2s are underscored by '** 
(residues in common with ProDom domain PD198850), (residues in common with 
ProDom domain PD01 5684), (residues in common with ProDom domain PD063669), 
10 and ' 1 ' (residues in common with ProDom domain PD195780). 

Figure 16 depicts the cDNA sequence and predicted amino acid sequence of the 
Arabidopsis tiialiana CCP16. The complete nucleotide sequence (Figure 16 A) 
corresponds to nucleic acids 1 to 1 189 of SEQ ID NO:48. The complete amino acid 
sequence (Figure 16B) corresponds to amino acids 1 to 292 of SEQ ID NO:l 14. Indicated 
15 in Figure 1 6 A are the stop and the three possible start codons (all in black shaded boxes) 
and the primers (grey shaded boxes) used to amplify the coding region of CCP16 by PCR. 
SEQ ID NOs of the primers used can be found in Table III. Nucleotide sequence 
differences between SEQ ID NO:48 and SEQ ID NO: 16 are depicted. Indicated in Figure 
16B are the DNA binding domain (black shaded box), DEF domain (grey shaded box), 
20 DCB 1 domain (single underlined) and DCB2 domain (double underlined), all domains 
characteristic for a DP protein. 

Figure 77 depicts the cDNA sequence and predicted amino acid sequence of the 
Arabidopsis thaliana CCP17. The complete nucleotide sequence (Figure 17 A) 
corresponds to nucleic acids 1 to 794 of SEQ ID NO: 17. The complete amino acid 
25 sequence (Figure 17B) corresponds to amino acids 1 to 173 of SEQ ID NO: 83. Indicated in 
Figure 17A are the stop and start codons (both in black shaded boxes) which are part of the 
primers (grey shaded boxes) used to amplify the coding region of CCP17 by PCR. SEQ 
ID NOs of the primers used can be found in Table III. 

Figure 18 depicts the cDNA sequence and predicted amino acid sequence of the 
30 Arabidopsis thaliana CCP18. The complete nucleotide sequence (Figure 18 A) 

corresponds to nucleic acids 1 to 805 of SEQ ID NO:49. The complete amino acid 
sequence (Figure 18B) corresponds to amino acids 1 to 165 of SEQ ID NO:l 15. 
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Underlined in Figure 15A and Figure 15B are the partially characterized nucleotide (SEQ 
ID NO: 18) and predicted partial amino acid (SEQ ID NO:84) sequence, respectively. 
Indicated in Figure 18A are the stop and start codons (both in black shaded boxes) which 
are part of the primers (grey shaded boxes) used to amplify the coding region of CCP18 by 
5 PCR. SEQ ID NOs of the primers used can be found in Table III. 

Figure 19 depicts the cDNA sequence and predicted amino acid sequence of the 
Arabidopsis thaliana CCP19. The complete nucleotide sequence (Figure 19 A) 
corresponds to nucleic acids 1 to 1 152 of SEQ ID NO: 19. The complete amino acid 
sequence (Figure IB) corresponds to amino acids 1 to 383 of SEQ ID NO: 85. Indicated in 

10 Figure 19A are the stop and start codons (both in black shaded boxes) which are part of the 
primers (grey shaded boxes) used to amplify the coding region of CCP19 by PCR. SEQ 
ID NOs of the primers used can be found in Table III. 

Figure 20 depicts the cDNA sequence of the Arabidopsis thaliana CCP20/21 . The 
complete nucleotide sequence corresponds to nucleic acids 1 to 1539 of SEQ ID NO:50. 

15 Underlined are the partially characterized 5' nucleotide (SEQ ID NO:20) sequence and the 
partially characterized 3' nucleotide (SEQ ID NO:21). Indicated in Figure 20 are the stop 
and start codons (both in black shaded boxes) which are part of the primers (grey shaded 
boxes) used to amplify the coding region of CCP20/21 by PCR. SEQ ID NOs of the 
primers used can be found in Table III. Nucleotide sequence differences between SEQ ID 

20 NOs:20-21 and SEQ ID NO:50 are depicted. 

Figure 21 depicts the predicted amino acid sequence of the Arabidopsis thaliana 
CCP20/21 . The complete amino acid sequence corresponds to amino acids 1 to 432 of 
SEQ ID NO: 1 1 6. Underlined are the partially characterized N-terminal predicted partial 
amino acid (SEQ ID NO:50) sequence and the partially characterized C-terminal amino 

25 predicted partial acid (SEQ ID NO: 87) sequence. Indicated are further differences in 
amino acid sequence between SEQ ID NO:87 and SEQ ID NO:l 16. 

Figure 22 depicts the cDNA sequence of the Arabidopsis thaliana CCP22. The 
complete nucleotide sequence corresponds to nucleic acids 1 to 1977 of SEQ ID NO:51. 
Underlined is the partially characterized nucleotide (SEQ ID NO:22). Indicated are the 

30 stop and start codons (both in black shaded boxes) which are part of the primers (grey 
shaded boxes) used to amplify the coding region of CCP22 by PCR. SEQ ID NOs of the 
primers used can be found in Table III. 
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Figure 23 depicts the predicted amino acid sequence of the Arabidopsis thaliana 
CCP22. The complete amino acid sequence corresponds to amino acids 1 to 559 of SEQ 
ID NO: 1 1 7. Underlined is the predicted partial amino acid (SEQ ID NO: 88) sequence. 

Figure 24 depicts the cDNA sequence and predicted amino acid sequence of the 
Arabidopsis thaliana CCP23 . The complete nucleotide sequence (Figure 24A) 
corresponds to nucleic acids 1 to 525 of SEQ ID NO:52. Indicated in Figure 24A are the 
stop and start codons (both in black shaded boxes) which are part of the primers (grey 
shaded boxes) used to amplify the coding region of CCP23 by PCR. SEQ ID NOs of the 
primers used can be found in Table III. Nucleotide sequence differences between SEQ ID 
NOs:23 and SEQ ID NO:52 are depicted. The amino acid sequence in Figure 24B 
corresponds to amino acids 1 to 98 of SEQ ID NO:89. The complete amino acid sequence 
in Figure 24C corresponds to amino acids 1 to 86 of SEQ ID NO:l 18. 

Figure 25 depicts the cDNA sequence of the Arabidopsis thaliana CCP24. The 
complete nucleotide sequence corresponds to nucleic acids 1 to 2610 of SEQ ID NO:53. 
Underlined is the partially characterized nucleotide (SEQ ID NO:24). Indicated are the 
stop and start codons (both in black shaded boxes) which are part of the primers (grey 
shaded boxes) used to amplify the coding region of CCP24 by PCR. SEQ ID NOs of the 
primers used can be found in Table III. 

Figure 26 depicts the predicted amino acid sequence of the Arabidopsis thaliana 
CCP24. The complete amino acid sequence corresponds to amino acids 1 to 784 of SEQ 
ID NO:l 19. Underlined is the predicted partial amino acid (SEQ ID NO:90) sequence. 

Figure 27 depicts the cDNA sequence of the Arabidopsis thaliana CCP25. The 
complete nucleotide sequence corresponds to nucleic acids 1 to 2235 of SEQ ID NO:54. 
Underlined is the partially characterized nucleotide (SEQ ID NO:25) sequence. Indicated 
are stop and start codon (both in black shaded boxes) which are part of the primers (grey 
shaded boxes) used to amplify the coding region of CCP25 by PCR. SEQ ID NOs of the 
primers used can be found in Table III. 

Figure 28 depicts the predicted amino acid sequence of the Arabidopsis thaliana 
CCP25. The complete amino acid sequence corresponds to amino acids 1 to 724 of SEQ 
ID NO: 120. Underlined is the predicted partial amino acid (SEQ ID NO:91) sequence. 

Figure 29 depicts the cDNA sequence of the Arabidopsis thaliana CCP26. The 
complete nucleotide sequence corresponds to nucleic acids 1 to 4002 of SEQ ID NO: 55. 
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Underlined is the partially characterized nucleotide (SEQ ID NO:26) sequence. Indicated 
are stop and start codon (both in black shaded boxes) which are part of the primers (grey 
shaded boxes) used to amplify the coding region of CCP26 by PCR. SEQ ID NOs of the 
primers used can be found in Table III. Nucleotide sequence differences between SEQ ID 
5 NOs:26 and SEQ ID NO:55 are depicted. 

Figure 30 depicts the predicted amino acid sequence of the Arabidopsis thaliana 
CCP26. The complete amino acid sequence corresponds to amino acids 1 to 1313 of SEQ 
ID NO:121. Underlined is the predicted partial amino acid (SEQ ID NO:92) sequence. 
Amino acid sequence differences between SEQ ID NOs:92 and SEQ ID NO:121 are 
10 depicted. 

Figure 31 depicts the cDNA sequence and predicted amino acid sequence of the 
Arabidopsis thaliana CCP27. The complete nucleotide sequence (Figure 31 A) 
corresponds to nucleic acids 1 to 1251 of SEQ ID NO:56. The complete amino acid 
sequence (Figure 31B) corresponds to amino acids 1 to 310 of SEQ ID NO:122. 
15 Underlined in Figure 3 1 A and Figure 3 1 B are the partially characterized nucleotide (SEQ 
ID NO:27) and predicted partial amino acid (SEQ ID NO:93) sequence, respectively. 
Indicated in Figure 3 1 A are the stop and start codons (both in black shaded boxes) which 
are part of the primers (grey shaded boxes) used to amplify the coding region of CCP27 by 
PCR. SEQ ID NOs of the primers used can be found in Table III. Nucleotide sequence 
20 differences between SEQ ID NO:27 and SEQ ID NO:56 are depicted in Figure 3 1 A. 

Figure 32 depicts the cDNA sequence of the Arabidopsis thaliana CCP28. The 
complete nucleotide sequence corresponds to nucleic acids 1 to 2955 of SEQ ID NO:56. 
Underlined is the partially characterized nucleotide (SEQ ID NO:28) sequence. Indicated 
are the stop and start codons (both in black shaded boxes) which are part of the primers 
25 (grey shaded boxes) used to amplify the coding region of CCP28 by PCR. SEQ ID NOs of 
the primers used can be found in Table III. Nucleotide sequence differences between SEQ 
ID NO:28 and SEQ ID NO:57 are depicted. 

Figure 33 depicts the predicted amino acid sequence of the Arabidopsis thaliana 
CCP28. The complete amino acid sequence corresponds to amino acids 1 to 964 of SEQ 
30 ID NO: 123. Underlined is the predicted partial amino acid (SEQ ID NO:94) sequence. 

Figure 34 depicts the cDNA sequence and predicted amino acid sequence of the 
Arabidopsis thaliana CCP29. The complete nucleotide sequence (Figure 34A) 
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corresponds to nucleic acids 1 to 546 of SEQ ID NO:29. The complete amino acid 
sequence (Figure 34B) corresponds to amino acids 1 to 181 of SEQ ID NO:95. Indicated in 
Figure 34A are the stop and start codons (both in black shaded boxes) which are part of the 
primers (grey shaded boxes) used to amplify the coding region of CCP29 by PCR. SEQ ID 
NOs of the primers used can be found in Table III. 

Figure 35 depicts the cDNA sequences and predicted amino acid sequences of the 
Arabidopsis thaliana CCP30. The complete nucleotide sequence (Figure 35 A) 
corresponds to nucleic acids 1 to 492 of SEQ ID NO:30. Indicated in Figure 35A are the 
stop and start codons (both in black shaded boxes) , the complete sense primer and part of 
the antisense primer (grey shaded boxes) used to amplify the coding region of CCP30 by 
PCR. SEQ ID NOs of the primers used can be found in Table III. However, after 
sequencing of the PCR product a sequence error in SEQ ID NO:30 was detected (boxed 
nucleotide 6 a' in Figure 35 A not present) which caused a frame shift effectuating the 
CCP30 open reading frame. The putative correct cDNA sequence is given in Figure 35B 
(nucleic acids 1 to 865 of SEQ ID NO:58) wherein the three putative start codons are 
marked by a black shaded box. The originally identified start codon is indicated in bold 
letters. The stop codon is unaltered. The amino acid sequence in Figure 35C corresponds 
to amino acids 1 to 163 of SEQ ID NO:96, the protein encoded by the initially identified 
open reading frame of SEQ ID NO:30. The putative correct complete amino acid sequence 
in Figure 35D corresponds to amino acids 1 to 222 of SEQ ID NO:124 which comprises 
the longest possible open reading frame. The Met residues corresponding to the three 
possible start codons in SEQ ID NO:58 (Figure 35B) are bold faced. 

Figure 36 depicts the cDNA sequence of the Arabidopsis thaliana CCP3 1 . The 
complete nucleotide sequence corresponds to nucleic acids 1 to 723 of SEQ ID NO:31. 
Indicated in Figure 1 A are the stop and start codons (both in black shaded boxes). 

Figure 37 depicts the predicted amino acid sequence of the Arabidopsis thaliana 
CCP31. The complete amino acid sequence corresponds to amino acids 1 to 148 of SEQ 
IDNO:125. 

Figure 38 depicts the cDNA sequence and predicted amino acid sequence of the 
Arabidopsis thaliana CCP32. The complete nucleotide sequence (Figure 38A) 
corresponds to nucleic acids 1 to 426 of SEQ ID NO: 60. The complete amino acid 
sequence (Figure 38B) corresponds to amino acids 1 to 70 of SEQ ID NO:126. Underlined 
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in Figure 38A is the partially characterized nucleotide (SEQ ID NO:32) sequence. 
Indicated in Figure 38A are the stop and start codons (both in black shaded boxes) which 
are part of the primers (grey shaded boxes) used to amplify the coding region of CCP32 by 
PCR. SEQ ID NOs of the primers used can be found in Table IE. Figure 38C gives the 
5 originally erroneously predicted amino acid sequence of CCP32 (amino acids 1 to 38 of 
SEQIDNO:98). 

Figure 39 depicts the cDNA sequence and predicted amino acid sequence of the 
Arabidopsis thaliana CCP33. The complete nucleotide sequence (Figure 39A) 
corresponds to nucleic acids 1 to 1442 of SEQ ID NO:61. The complete amino acid 

10 sequence (Figure 39B) corresponds to amino acids 1 to 385 of SEQ ID NO: 127. Indicated 
in Figure 39A are the stop and start codons (both in black shaded boxes) which are part of 
the primers (grey shaded boxes) used to amplify the coding region of CCP33 by PCR. SEQ 
ID NOs of the primers used can be found in Table III. Indicated in Figure 39B are the 
DNA binding domain (black shaded box), DEF domain (grey shaded box), DCB1 domain 

15 (single underlined) and DCB2 domain (double underlined), all domains characteristic for a 
DP protein. 

Figure 40 depicts the cDNA sequence and predicted amino acid sequence of the 
Arabidopsis thaliana CCP34. The complete nucleotide sequence (Figure 40A) 
corresponds to nucleic acids 1 to 1506 of SEQ ID NO:62. The complete amino acid 

20 sequence (Figure 40B) corresponds to amino acids 1 to 437 of SEQ ID NO: 128. 

Underlined in Figure 40A and Figure 40B are the partially characterized nucleotide (SEQ 
ID NO:34) and predicted partial amino acid (SEQ ID NO:62) sequence, respectively. 
Indicated in Figure 40A are the stop and start codons (both in black shaded boxes) which 
are part of the primers (grey shaded boxes) used to amplify the coding region of CCP34 by 

25 PCR. SEQ ID NOs of the primers used can be found in Table III. 

Figure 41 depicts the cDNA sequence of the Arabidopsis thaliana CCP35. The 
complete nucleotide sequence corresponds to nucleic acids 1 to 2631 of SEQ ID NO:63. 
Underlined is the partially characterized nucleotide (SEQ ID NO:35) sequence. Indicated 
are the stop and start codons (both in black shaded boxes) and of the primers (grey shaded 

30 boxes) used to amplify the coding region of CCP35 by PCR. SEQ ID NOs of the primers 
used can be found in Table HI. Nucleotide sequence differences between SEQ ID NO:33 
and SEQ ID NO:63 are depicted. 
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Figure 42 depicts the predicted amino acid sequence of the Arabidopsis ihaliana 
CCP35. The complete amino acid sequence corresponds to amino acids 1 to 749 of SEQ 
ID NO: 129. Underlined is the predicted partial amino acid (SEQ ID NO:101) sequence. 

Figure 43 depicts the cDNA sequence of the Arabidopsis thaliana CCP36. The 
5 complete nucleotide sequence corresponds to nucleic acids 1 to 2743 of SEQ ID NO:64. 
Underlined is the partially characterized nucleotide (SEQ ID NO:36) sequence. Indicated 
are the stop and start codons (both in black shaded boxes). Nucleotide sequence differences 
between SEQ ID NO:36 and SEQ ID NO:64 are depicted. 

Figure 44 depicts the predicted amino acid sequence of the Arabidopsis thaliana 
10 CCP36. The complete amino acid sequence corresponds to amino acids 1 to 742 of SEQ 
ID NO:130. Underlined is the predicted partial amino acid (SEQ ID NO:102) sequence. 

Figure 45 depicts the cDNA sequence of the Arabidopsis thaliana CCP37. The 
complete nucleotide sequence corresponds to nucleic acids 1 to 2959 of SEQ ID NO:65. 
Underlined is the partially characterized nucleotide (SEQ ID NO:37) sequence. Indicated 
15 are the stop and start codons (both in black shaded boxes) and primers (grey shaded boxes) 
used to amplify the coding region of CCP45 by PCR. SEQ ID NOs of the primers used can 
be found in Table IE. 

Figure 46 depicts the predicted amino acid sequence of the Arabidopsis thaliana 
CCP37. The complete amino acid sequence corresponds to amino acids 1 to 91 1 of SEQ 
20 ID NO:131. Underlined is the predicted partial amino acid (SEQ ID NO:103) sequence. 
Indicated in a black shaded box is a SAP-like domain. 

Figure 47 depicts the cDNA sequence and predicted amino acid sequence of the 
Arabidopsis thaliana CCP38. The complete nucleotide sequence (Figure 47 A) 
corresponds to nucleic acids 1 to 1295 of SEQ ID NO:66. The complete amino acid 
25 sequence (Figure 47B) corresponds to amino acids 1 to 357 of SEQ ID NO: 132. 

Underlined in Figure 47A and Figure 47B are the partially characterized nucleotide (SEQ 
ID NO:38) and predicted partial amino acid (SEQ ID NO:104) sequence, respectively. 
Indicated in Figure 47A are the stop and start codons (both in black shaded boxes) which 
are part of the primers (grey shaded boxes) used to amplify the coding region of CCP38 by 
30 PCR. SEQ ID NOs of the primers used can be found in Table III. 

Figure 48 depicts phosphorylation of the Arabidopsis thaliana CGP4 by CDKs. 
The protein CDC2bDN-IC26M (SEQ ID NO:70) contains a consensus CDK 
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phosphorylation site (TPWK, residues 54-57 of SEQ ID NO:263). The corresponding gene 
(SEQ ID NO:4) was expressed in E. coli and the protein was purified from the crude 
extracts. The purified protein was subsequently shown to be phosphorylated by CDKs in 
an in vitro CDK phosphorylation assay. -: no IC26M added; +: IC26M added. 
5 Figure 49 schematically represents the domain organization of AtE2Fa and 

AtE2Fb. The DNA-binding domain (DB), the dimerization domain (DIM), the marked box 
(MB), and the Rb-binding domain (RB) are indicated by marked boxes, the N-terminal 
domains are indicated by open boxes. Numbering on the right refers to the amino acid 
sequence contained in the different AtE2F constructs, which were used in the in vitro 

10 binding assays. 

Figure 50 depicts AtDPa in vitro interactions with AtE2Fa and AtE2Fb. The c- 
jwyc-tagged AtDPa (c-jwyc-AtDPa) was in vitro translated and used as control. The lower 
migrating proteins observed in the case of c-rayc- AtDPa are most probably due to initiation 
of translation at internal methionine codons (panel A, unnumbered left lane). The c-myc- 

15 AtDPa was in vitro co-translated with HA-AtE2Fb (panels A and B, lane 1), HA-AtE2Fa 
(panels B, lane 2), the C-terminal deleted form of HA-AtE2Fb (panels A and B, lane 3), 
HA-AtE2Fa 1-420 (panels A and B, lane 4) and the N-terminal truncated form of HA- 
AtE2Fa 162-485 (panels A and B, lane 5) as indicated. Numbers in the case of the mutant 
AtE2Fs refer to the amino acid sequence contained in these constructs (see Figure 49). An 

20 aliquot of each sample was analyzed directly by SDS-P AGE and autoradiographed (panel 
A; total IVT, total in vitro translation). Another aliquot of the same samples was subjected 
to immunoprecipitation with anti-c-myc monoclonal antibodies (panel B), lanes are 
indicated by numbering. The position of c-myc- AtDPa proteins are marked by arrows in 
both panels. Molecular mass markers are indicated at the left. 

25 Figure 51 shows AtDPb in vitro interactions with AtE2Fa and AtE2Fb. The c-myc- 

tagged AtDPb (c-wyc-AtDPb, panels A and B, lane 2) and the HA-tagged AtE2Fb (HA- 
AtE2Fb, panels A and B, lane 1) were in vitro translated and used as controls. The lower 
migrating proteins observed in the case of c-myc-AtDPb are most probably due to initiation 
of translation at internal methionine codons (panel A, lane 2). The c-myc- AtDPb was in 

30 vitro co-translated with HA-AtE2Fb (panels A and B, lane 3), HA-AtE2Fa (panels A and 
B lane 4), HA-AtE2Fa 1-420 (panels A and B, lane 5) and the N-terminal truncated form 
of HA-AtE2Fa 162-485 (panels A and B, lane 6) as indicated. Numbers in the case of the 
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mutant AtE2Fs refer to the amino acid sequence contained in these constructs (see Figure 
49). An aliquot of each sample was analyzed directly by SDS-PAGE and autoradiographed 
(panel A; total IVT, total in vitro translation). Another aliquot of the same samples was 
subjected to immunoprecipitation with anti-c-myc monoclonal antibodies (panel B), lanes 
5 are indicated by numbering. The c-/wvc- AtDPb (panels A and B, lanes 2-6; indicated with 
Y) co-migrated almost exactly with the mutant HA-AtE2Fa 1-420 (panels A and B, lane 
5; indicated w^ -z') 
in the gel system. These polypeptides as well as the position of c-/wyc-AtDPa and c-myc- 
AtDPb proteins are marked by arrows marked with 'y\ 6 x' and 'z\ respectively (cfr. 

10 supra). Molecular mass markers are indicated at the left. 

Figure 52 schematically represents AtDPa and mutants. The DNA-binding domain 
(DB) and the dimerization domain (DIM) are indicated by marked boxes, N- and C- 
terminal regions are indicated by open boxes. Numbering on the right side refers to the 
amino acid sequence contained in the different AtDP constructs, which were used in the in 

15 vitro binding assays. 

Figure 53 schematically represents AtDPb and mutants. The DNA-binding domain 
(DB) and the dimerization domain (DIM) are indicated by marked boxes, N- and C- 
terminal regions are indicated by open boxes. Numbering on the right side refers to the 
amino acid sequence contained in the different AtDP constructs, which were used in the in 
20 vitro binding assays. 

Figure 54 shows the mapping of regions in AtDPa required for in vitro binding to 
AtE2Fb. HA-AtE2Fb was co-translated with series of c-myc-AtDPa mutants. An aliquot of 
each sample was analyzed directly by SDS-PAGE and autoradiographed (panel A). 
Another aliquot of the same samples was subjected to immunoprecipitation with anti-HA 
25 (panel B) or anti-c-myc (panel C) monoclonal antibodies. The o/wyc-AtDPa mutants are 
marked by dots. Positions of the HA-AtE2Fb proteins are indicated by arrows. Molecular 
mass markers are indicated at the left. 

Figure 55 shows the mapping of regions in AtDPb required for in vitro binding to 
. AtE2Fb. HA-AtE2Fb was co-translated with series c-myc-AtDPb mutants. An aliquot of 
30 each sample was analyzed directly by SDS-PAGE and autoradiographed (panel A). 

Another aliquot of the same samples was subjected to immunoprecipitation with anti-HA 
(panel B) or anti-c-myc (panel C) monoclonal antibodies. The c-myc-AtDPb mutants are 
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marked by dots. Positions of the HA-AtE2Fb proteins are indicated by arrows. Molecular 
mass markers are indicated at the left. 

Figure 56 shows the mapping of regions in AtDPb required for in vitro binding to 
AtE2Fb. HA-AtE2Fb was co-translated with c-myc-AtDPb 1 82-263. Because of the small 
5 size of this protein, it was hardly detectable when it was directly analyzed by SDS-PAGE 
(data not shown). An aliquot of this sample was subjected to immunoprecipitation with 
anti-c-myc monoclonal antibodies. The c-myc-AtDP mutant is marked by dots. Position of 
the HA-AtE2Fb protein is indicated by an arrow. Molecular mass markers are indicated at 
the left. 

10 Figure 57 shows organ- and cell cycle-specific expression of AtE2Fa and AtDPa. 

Tissue-specific expression of AtDPa and AtE2Fa genes. cDNA prepared from the 
indicated tissues was subjected to semi-quantitative RT-PCR analysis. The 
Arath;CDKBl;l gene was used as a marker for highly proliferating tissues. The actin 2 
gene (ACT2) was used as loading control. 

15 Figure 58 shows organ- and cell cycle-specific expression of AtE2Fa and AtDPa, 

Co-regulated cell cycle phase-dependent transcription of AtE2Fa and AtDPa. The cDNA 
was prepared from partially synchronized Arabidopsis cells harvested at the indicated time 
point after removal of the cell cycle blocker was subjected to semi-quantitative RT-PCR 
analysis. Histone H4 and Arath;CDKBl ;1 were used as markers for S and G2/M phase, 

20 respectively, and ROC5 and Arath;CDKA;l as loading controls. 

Figure 59 is a photographic representation of Northern blotting analysis of DPa 
expression in independent Arabidopsis thaliana DPa overexpressing lines (lines 16-27 as 
indicated) and one untransformed control line (indicated by C). 

Figure 60 describes the molecules denned in SEQ ID NOs:199-204 and 240-290. 

25 Detailed Description of the Invention 

The present invention is based, at least in part, on the discovery of novel molecules, 
referred to herein as "cell cycle proteins" or "CCP " nucleic acid and polypeptide 
molecules. The CCP molecules of the present invention were identified based on their 
ability, as determined using yeast two-hybrid assays (described in detail in Example 1), to 
30 interact with proteins involved in the cell cycle, such as plant cyclin dependent kinases 
(e.g., a dominant negative form of CDC2b, CDC2bAt.N161), cyclin dependent kinase 
subunits referred herein as "CKS" (such as CKSlAt), cyclin dependent kinase inhibitors 
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referred to herein as "CKF (such as CKI4), PHO80-like proteins referred to herein as 
"PLP", E2F, and different domains of kinesin-like proteins referred to herein as "KLPNT. 

Because of their ability to interact with (e.g., bind to) the cyclin dependent 
kinases, the CCP molecules of the present invention may modulate, e.g. , upregulate or 
5 downregulate, the activity of plant CDKs, such as CDC2a or CDC2b; CKSs, CKIs, PLPs 
and KLPNTs. Furthermore, because of their ability to interact with (e.g., bind to) the 
aforementioned proteins which are proteins involved in cell cycle regulation, the CCP 
molecules of the present invention may also play a role in or function in cell cycle 
regulation, e.g., plant or animal cell cycle regulation. 

10 As used herein, the term "cell cycle protein" includes a polypeptide which is 

involved in controlling or regulating the cell cycle, or part thereof, in a cell, tissue, organ 
or whole organism. Cell cycle proteins may also be capable of binding to, regulating, or 
being regulated by cyclin dependent kinases, such as plant cyclin dependent kinases, e.g., 
CDC2a or CDC2b, or their subunits. The term cell cycle protein also includes peptides, 

15 polypeptides, fragments, variant, homologs, alleles or precursors (e.g., pre-proteins or pro- 
proteins) thereof. 

As used herein, the term "cell cycle" includes the cyclic biochemical and structural 
events associated with growth, division and proliferation of cells, and in particular with the 
regulation of the replication of DNA and mitosis. The cell cycle is divided into periods 

20 called: G 0 , Gap } (G x ), DNA synthesis (S), Gap 2 (G 2 ), and mitosis (M). Normally these four 
phases occur sequentially, however, the cell cycle also includes modified cycles wherein 
one or more phases are absent resulting in modified cell cycle such as endomitosis, 
acytokinesis, polyploidy, polyteny, and endoreduplication. k 

As used herein, the term "plant" includes reference to whole plants, plant organ 

25 (e.g., leaves, stems, roots), plant tissue, seeds, and plant cells and progeny thereof. Plant 
cell, as used herein includes, without limitation, seeds, e.g., seed suspension cultures, 
embryos, meristematic regions, callus tissue, leaves, roots, shoots, gametophytes, 
sporophytes, pollen, and microspores. The class of plants which can be used in the 
methods of the invention is generally as broad as the class of higher plants amenable of 

30 transformation techniques, including both monocotyledonous and dicotyledonous plants. 
Particularly preferred plants are Arabidopsis thaliana, rice, wheat, maize, tomato, alfalfa, 
oilseed rape, soybean, cotton, sunflower or canola. The term plant also includes 
monocotyledonous (monocot) plants and dicotyledonous (dicot) plants including a fodder 
or forage legume, ornamental plant, food crop, tree, or shrub selected from the list 

35 comprising Acacia spp. t Acer spp., Actinidia spp.,Aesculus spp, Agathis australis, Albizia 
amara, Alsophila tricolor, Andropogon spp., Arachis spp, Areca catechu, Asteliafragrans, 
Astragalus cicer, Baikiaea plurijuga, Betula spp., Brassica spp., Bruguiera gymnorrhiza, 
Burkea africana, Butea frondosa, Cadaba farinosa, Calliandra spp, Camellia sinensis, 



WO 01/85946 



PCT/IB01/01307 



-21- 

Canna indica, Capsicum spp., Cassia spp., Centroema pubescens, Chaenomeles 
spp.,Cinnamomum cassia, Coffea arabica, Colophospermum mopane, Coronillia varia, 
Cotoneaster serotina, Crataegus spp, Cucumis spp., Cupressus spp., Cyathea dealbata, 
Cydonia oblonga, Cryptomeria japonica, Cymbopogon spp., Cynthea dealbata, Cydonia 
5 oblonga, Dalbergia monetaria, Davallia divaricata, Desmodium spp., Dicksonia squarosa, 
Diheteropogon amplectens, Dioclea spp, Dolichos spp., Dorycnium rectum, Echinochloa 
pyramidalis, Ehrartia spp., Eleusine coracana, Eragrestis spp., Erythrina spp, Eucalyptus 
spp, Euclea schimperi, Eulalia villosa, Fagopyrum spp., Feijoa sellowiana, Fragaria spp., 
Flemingia spp, Freycinetia banksii, Geranium thunbergii, Ginkgo biloba, Glycine 

10 javanica, Gliricidia spp, Gossypium hirsutum, Grevillea spp., Guibourtia coleosperma, 
Hedysarum spp., Hemarthia altissima, Heteropogon contortus, Hordeum vulgare, 
Hyparrhenia rufa, Hypericum erectum, Hyperthelia dissoluta, Indigo incarnata, Iris spp, 
Leptarrhena pyrolifolia, Lespediza spp., Lettuca spp., Leucaena leucocephala, Loudetia 
simplex, Lotonus bainesii, Lotus spp., Macrotyloma axillare, Malus spp., Manihot 

15 esculenta, Medicago sativa, Metasequoia glyptostroboides, Musa sapientum, Nicotianum 
spp., Onobrychis spp., Ornithopus spp., Oryza spp, Peltophorum africanum, Pennisetum 
spp., Persea gratissima, Petunia spp., Phaseolus spp., Phoenix canariensis, Phormium 
cookianum, Photinia spp., Picea glauca, Pinus spp., Pisum sativum, Podocarpus totara, 
Pogonarthria fleckii, Pogonarthria squarrosa, Populus spp., Prosopis cineraria, 

20 Pseudotsuga menziesii, Pterolobium stellatum, Pyrus communis, Quercus spp., 

Rhaphiolepsis umbellata, Rhopalostylis sapida, Rhus natalensis, Ribes grossularia, Ribes 
spp., Robinia pseudoacacia, Rosa spp., Rubus spp., Salix spp., Schyzachyrium 
sanguineum, Sciadopitys verticillata, Sequoia sempervirens, Sequoiadendron giganteum, 
Sorghum bicolor, Spinacia spp., Sporobolus fimbriatus, Stiburus alopecuroides, 

25 Stylosanthos humilis, Tadehagi spp, Taxodium distichum, Themeda triandra, Trifolium 

spp, Triticumspp., Tsuga heterophylla, Vaccinium spp., Vicia spp.Vitis vinifera, Watsonia 
pyramidata, Zantedeschia aethiopica, Zea mays, amaranth, artichoke, asparagus, broccoli, 
brussel sprout, cabbage, canola, carrot, cauliflower, celery, collard greens, flax, kale, lentil, 
oilseed rape, okra, onion, potato, rice, soybean, straw, sugarbeet, sugar cane, sunflower, 

30 tomato, squash, and tea, amongst others, or the seeds of any plant specifically named 
above or a tissue, cell or organ culture of any of the above species. 

The cell cycle proteins of the present invention are involved in cell cycle regulation 
which is largely, but not completely, similar in plants and animals. Accordingly, the 
nucleic acid molecules and polypeptide of the invention, or derivatives thereof, may be 

35 used to modulate the cell cycle in a plant or an animal such as by modulating the activity 
or level or expression of CCP, altering the rate of the cell cycle or phases of the cell cycle, 
and entry into and out of the various cell cycle phases. In plants, the molecules of the 
present invention may be used in agriculture to, for example, improve the growth 
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characteristics of plant such as growth rate or size of specific tissues or organs, the 
architecture or morphology of the plant, increase crop yield, improve tolerance to 
environmental stress conditions (such as drought, salt, temperature, or nutrient 
deprivation), improve tolerance to plant pathogens that abuse the cell cycle or as targets to 
5 facilitate the identification of inhibitors or activators of CCPs that may be useful as 
phytopharmaceuticals such as herbicides or plant growth regulators. 

As used herein, the term "cell cycle associated disorders" includes a disorder, 
disease or condition which is caused or characterized by a misregulation (e.g. , 
downregulation or upregulation), abuse, arrest, or modification of the cell cycle. In plants 

10 cell cycle associated disorders include endomitosis, acytokinesis, polyploidy, polyteny, 
and endoreduplication which may be caused by external factors such as pathogens 
(nematodes, viruses, fungi, or insects), chemicals, environmental stress (e.g., drought, 
temperature, nutrients, or UV) resulting in for instance neoplastic tissue (e.g., galls, root 
knots) or inhibition of cell division/proliferation (e.g., stunted growth). Cell cycle 

15 associated disorders in animals include proliferative disorders or differentiative disorders, 
such as cancer, e.g., melanoma, prostate cancer, cervical cancer, breast cancer, colon 
cancer, or sarcoma. 

The present invention is based, at least in part, on the discovery of novel molecules, 
referred to herein as CCP protein and nucleic acid molecules, which comprise a family of 

20 molecules having certain conserved structural and functional features. The term "family" 
when referring to the protein and nucleic acid molecules of the invention is intended to 
mean two or more proteins or nucleic acid molecules having a common structural domain 
or motif and having sufficient amino acid or nucleotide sequence homology as defined 
herein. Such family members can be naturally or non-naturally occurring and can be from 

25 either the same or different species. For example, a family can contain a first protein of 
plant, e.g. Arabidopsis, origin, as well as other, distinct proteins of plant, e.g., Arabidopsis, 
origin or alternatively, can contain homologues of other plants, e.g., rice, or of non-plant 
origin. Members of a family may also have common functional characteristics. 

In one embodiment of the invention, a CCP protein of the present invention is 

30 identified based on the presence of at least one or more of the following domains: 

A. Cyclin destruction box 

As used herein, the term "Cyclin destruction box" includes a domain of 9-10 amino 
acid residues in length which typically contains the following consensus pattern: 
R - X 2 - L - X 2 - p/V] -X M - N (SEQ ID NO:267), 
35 wherein X can be any amino acid, X„ is a stretch of n Xs, X^ is a stretch of n to m Xs, 
and wherein [I/V] means that an lie or Val residue can occur at that position. SEQ ID 
NO:267 depicts the minimal consensus sequence of the cyclin destruction box and 



WO 01/85946 



PCT/IB01/01307 



-23- 

underlies the ubiquitin-mediated proteolytic destruction of the cyclins bearing this motif 
(Yamano et al (1998), EMBOJ. 17: 5670-5678; Renaudin et al (1998) in Plant Cell 
Division (Francis, Dudits and Inze, eds.), Portland Press Research Monograph, Portland 
Press Ltd. London (1998), pp 67-98). 

5 

B. Cyclin box motif 1 

As used herein, the term "Cyclin box motif 1" includes a domain of 8 amino acid 
residues in length and which typically contains the following consensus pattern: 

MRXILfl/VJDW (SEQ ID NO:268), 
10 wherein X can be any amino acid and wherein [W] means that an He or Val residue can 
occur at that position. This motif forms part of the helix HI of the first cyclin fold and is 
the best conserved motif in the cyclinA/B family (Renaudin et al (1998) in Plant Cell 
Division (Francis, Dudits and Inze, eds.), Portland Press Research Monograph, Portland 
Press Ltd. London (1998), pp 67-98). 

15 

C. Cyclin box motif 2 

As used herein, the term "Cyclin box motif 2" includes a domain of 8 amino acid 
residues in length and which typically contains the following consensus pattern: 

KYEE - X 3 - P (SEQ ID NO:269), 
20 wherein X can be any amino acid and wherein X,, is a stretch of n Xs. This motif forms 
part of the helix H3 of the first cyclin fold wherein the 2 acidic residues are part of the 
CDK binding site (Renaudin et al (1998) in Plant Cell Division (Francis, Dudits and Inze, 
eds.), Portland Press Research Monograph, Portland Press Ltd. London (1998), pp 67-98). 

25 D. CDC2 motifs 

As used herein, the term "CDC2 motifs" includes domains of about 9-12 amino 
acid residues in length and which typically contain one of the following consensus 
patterns: 

GXG -X 2 _ GXVY (SEQ ID NO:270) 
30 HRDXK-X 2 - NXL (SEQ ID NO:271) 

D-X^-IWYJSXG -X 4 - E (SEQ ID NO:272) 
wherein wherein X can be any amino acid, X„ is a stretch of n Xs, X^ is a stretch of n to 
m Xs, and wherein [W/Y] means that an Trp or Tyr residue can occur at that position. 
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E. CDK phosphorylation site 

As used herein the term "CDK phosphorylation site" includes a domain of about 5- 
7 amino acids in length and which contains one or more of the following consensus 
domains: 

5 TPX^PR/K] (SEQ ID NO:273) 

SPX[R/K] (SEQ ID NO:274) 
SPX(Hu) (SEQ ID NO:275) 
SP(Hu)X (SEQ ID NO:276) 

10 with Hu being a hydrophobic uncharged amino acid (M, I, L, V) and X any amino acid. 
The foregoing are typically found in cyclin-dependent kinase substrates such as histone 
kinase, transcription factors such as E2F or transcription regulators like Rb. CDK 
phosphorylation sites are described in, for example, Tamrakar et aL 2000, Frontiers Biosci 
5, dl21-137. 

15 CCP proteins of the present invention comprising a CDK phosphorylation site can 

be mutated in said CDK phosphorylation site such that said CCP proteins are no longer 
able to be phosphorylated on the CDK phosphorylation site. Mutations of a CDK 
phosphorylation site include all mutations of the ser or thr residue in any of SEQ ID 
NOs:273-276 into a non-phosphorylatable amino acid residue, e.g., an ala or glu residue. 

20 Mutation of one or more CDK phosphorylation site(s) in a CCP protein of the invention is 
expected to modulate modifications of the CCP protein by CDKs and, thus, to modulate 
the biological or biochemical function of the CCP protein. 

F. E Nuclear localisation signal (NLS) 
25 As used herein the term "nuclear localization signal" or "NLS" includes a domain 

conferring to a protein comprising the NLS domain the ability to be imported into the 
nucleus and to, for example, accumulate within the nucleus. NLS domains include one or 
more of the following concensus patterns: 

PKKKRKV (SEQ ID NO:277) 
KJRX 10 KKKK (SEQ ID NO:278) 

KRPRP (SEQ ID NO:279) 
PAAKRVKLD (SEQ ID NO:280) 
NLS domains have been found in the SV40 T antigen, in nucleoplasmin (bipartite 
NLS), in a Adeno EI A, and in c-Myc. NLS domains are described in, for example, Laskey 
etaL (1998) Biochem. Soc. Trans. 26, 561-567. 



30 



35 
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G. Cy-like boxes 

As used herein, the term "Cy-like box" includes a domain of 3-6 amino acid 
residues in length with has the consensus motif R-X-X-F (SEQ ID NO:281) with X being 
any amino acid and one of two Xs preferably being a hydrophobic residue. 

5 

H. Rb binding domain 

As used herein, the term "Rb binding domain" includes a domain which when 
present in a protein confers to the protein the ability to bind the Rb protein. Rb binding 
domains include one or more of the following concensus paterns; 

10 

LXCXE (SEQIDNO:282) 
LXSXE (SEQIDNO:283) 
DYX 7 EX 3 DLFD (SEQ ID NO:284) 
DYX 6 DX 4 DMWE (SEQ ED NO:285) 

15 

Rb binding domains have been found in D-cyclins, in protein phosphatase 1, in 
human E2F-1, and in plant E2F, Rb binding domains are described in, for example, Rubin 
et aL (1998) Frontiers Biosci 3, dl209-1219; Phelps et al (1992) J. Virol 66, 241 8-2427, 
and Cress etaL (1993) MoL Cell Biol 13,6314-6325. 

20 

I. DEF Domain 

As used herein the term "DEF domain" includes a protein domain which is required 
for the formation of heterodimers between DP proteins and E2F proteins. DEF domains 
comprise the following concensus pattern: 

25 

[D/N/-] [Q/E]KMR[R/G]RV|T/D]DA 
[V/I]I[S/A][K/R]P^ 

(SEQ ID NO:286) 



30 J. DNA Binding Domain 

As used herein the term "DNA binding domain" includes a domain which is 
involved in the binding of DP proteins and/or DP-E2F heterodimers to DNA. DNA 
binding domains include the following concensus pattern: 

35 [G/N][K/R]GLR[H/Q]FS[ 

[R/I/-]TT[S/-]Y[N^ 

(SEQ ID NO:287) 
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DNA binding domains are described in, for example, Hao et al. (1995) J. Cell Sci. 108, 
2945-2954; Bandarae/ al. (1993) EMBO J. 12, 4317-4324; and Girling et al. (1994) Mol. 
Biol. Cell 5,1081-1092. 

5 K. DCB1 Domain: 

As used herein the term "DCB1 domain" includes a protein domain which is 
conserved among DP proteins and has the following concensus patterns: 

[R/S]p/V3X[Q/K]KX 3 [L/S]XE 
10 (SEQ ID NO:288) 

[R/S][W]X[Q/K]KX 3 [L/S]XE[L/M]X 2 ^ 

(SEQIDNO:289) 

DCB 1 domains are described in, for example, Hao et al. (1995) J. Cell Sci. 1 08, 2945- 
15 2954; Bandara et al. (1993) EMBO J. 12, 4317-4324; and Girling et al. (1994) Mol. Biol. 
CellS, 1081-1092. 

L. DCB2 Domain: 

As used herein the term "DCB2 domain" includes a protein domain which is 
20 conserved among DP proteins and has the following concensus patern: 

[L/I]PFI|IVI] ly/LJXTX^f^^ 

(SEQIDNO:290) 

25 DCB2 domains are described in, for example, Hao et al. (1995) J. Cell Sci. 108, 2945- 
2954; Bandarae/ al. (1993) EMBO J. 12, 4317-4324; and Girling et al. (1994) Mol. Biol. 
Cell 5, 1081-1092. 

M. SAP Domain: 

30 As used herein the term SAP motif includes a protein domain of about 35 amino 

acid residues which is found in a variety of nuclear proteins involved in transcription, 
DNA repair, DNA processing or apoptotic chromatin degradation. It was named after SAF' 
A/B, Acinus and PIAS, three proteins known to contain it. The SAP motif reveals a 
bipartite distribution of strongly conserved hydrophobic, polar and bulky amino acids 

35 separated by a region that contains a glycine. The SAP domain has been proposed to be a 
DNA-binding motif (Aravind and Koonin (2000) Trends Biochem. Sci. 25:1 12-1 14). 



WO 01/85946 



PCTYIB01/01307 



-27- 

Isolated CCP proteins of the present invention have an amino acid sequence 
sufficiently identical to the amino acid sequence of SEQ ID NO:67-132, 205, 21 1, 215- 
216, or 220-227 or are encoded by a nucleotide sequence sufficiently identical to SEQ ID 
NO: 1-66 or 228-239. As used herein, the term "sufficiently identical" refers to a first 

5 amino acid or nucleotide sequence which contains a sufficient or minimum number of 
identical or equivalent (e.g. , an amino acid residue which has a similar side chain) amino 
acid residues or nucleotides to a second amino acid or nucleotide sequence such that the 
first and second amino acid or nucleotide sequences share common structural domains or 
motifs and/or a common functional activity. For example, amino acid or nucleotide 

10 sequences which share common structural domains have at least 30%, 40%, or 50% 
homology, preferably 60% homology, more preferably 70%-80%, and even more 
preferably 90-95% homology across the amino acid sequences of the domains and contain 
at least one and preferably two structural domains or motifs, are defined herein as 
sufficiently identical. Furthermore, amino acid or nucleotide sequences which share at 

15 least 30%, 40%, or 50%, preferably 60%, more preferably 70-80%, or 90-95% homology 
and share a common functional activity are defined herein as sufficiently identical. 

As used interchangeably herein, an "CCP activity", "biological activity of CCP " or 
"functional activity of CCP", refers to an activity exerted by a CCP protein, polypeptide or 
nucleic acid molecule on a CCP responsive cell or tissue, or on a CCP protein substrate, as 

20 determined in vivo, or in vitro, according to standard techniques. In one embodiment, a 
CCP activity is a direct activity, such as an association with a CCP-target molecule. As 
used herein, a "target molecule" or "binding partner" is a molecule with which a CCP 
protein binds or interacts in nature, such that CCP-mediated function is achieved. A CCP 
target molecule can be a non-CCP molecule or a CCP protein or polypeptide of the present 

25 invention, e.g., a plant cyclin dependent kinase, such as CDC2b. In an exemplary 

embodiment, a CCP target molecule is a CCP ligand. Alternatively, a CCP activity is an 
indirect activity, such as a cellular signaling activity mediated by interaction of the CCP 
protein with a CCP ligand. The biological activities of CCP are described herein. For 
example, the CCP proteins of the present invention can have one or more of the following 

30 activities: (1) they may interact with a non-CCP protein molecule, e.g., a CCP ligand; (2) 
they may modulate a CCP-dependent signal transduction pathway; (3) they may modulate 
the activity of a plant cyclin dependent kinase, such as CDC2a, CDC2b, or CDC2c, and (4) 
they may modulate the cell cycle. 

Accordingly, another embodiment of the invention features isolated CCP proteins 

35 and polypeptides having a CCP activity. Preferred proteins are CCP proteins having at 

least one or more of the following domains: a "cyclin destruction box", a "cyclin box motif 
1", a "cyclin box motif 2", a "CDC2 motif, a "CDK phosphorylation site", a "nuclear 
localization signal", a "Cy-like box", an "Rb binding domain", a "DEF domain", a "DNA 
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binding domain", a "DCBl domain", a "DCB2 domain" and/or a SAP domain, and, 
preferably, a CCP activity. 

Additional preferred proteins have at least one or more of the following domains: a 
"cyclin destruction box", a "cyclin box motif 1", a "cyclin box motif 2", a "CDC2 motif 5 , 
5 a "CDK phosphorylation site", a "nuclear localization signal", a "Cy-like box", an "Rb 
binding domain", a "DEF domain", a "DNA binding domain", a "DCBl domain", a 
"DCB2 domain" and/or a SAP domain and are, preferably, encoded by a nucleic acid 
molecule having a nucleotide sequence which hybridizes under stringent hybridization 
conditions to a nucleic acid molecule comprising the nucleotide sequence of SEQ ID 
10 NO:l-66 or 228-239. 

The sequences of the present invention are summarized below, in Table I. 



TABLE I: 

15 



CCP 
Molecule 


Clone 
Name 


Bait 


Homolog/ 
function 


motif 


SEQ 
ID 
NO: 
partia 
1 

DNA 


SEQ 
ID 
NO: 
full- 
length 
DNA 


SEQ 
ID 
rsO. 
partial 
Protei 
n 


SEQ 
ID 

full- 
length 
Protei 
n 


CCP1 


CDC2bD 
N-IC19 


CDC2bAt. 
N161 


Novel CYCB2;3 


cyclin box 
motifs 1 
and 2; 
cyclin 
destruction 
box 


1 


39 


67 


105 


CCP2 


CDC2bD 
N-IC20 


CDC2bAt. 
N161 


ARR2 




2 


40 


68 


106 


CCP3 


CDC2bD 
N-IC21 


CDC2bAt 
N161 


novel A-type 
cyclin 


cyclin box 
motifs 1 
and 2; 
cyclin 
destruction 
box 


3 


41 


69 


107 


CCP4 


CDC2bD 
N-IC26M 


CDC2bAt. 
N161 




CDK 
phospho- 
rylation 
site 


4 


4 


70 


70 


CCP5 


CDC2bD 
N-IC39 


CDC2bAt. 
N161 


ArathCYCB2 

;i 


cyclin box 
motifs 1 
and 2; 
cyclin 


5 


5 


71 


71 
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destruction 
box 










CCP6 


CDC2bD 
N-IC57 


CDC2bAt. 
N161 






6 


42 


72 


108 


CCP7 


CDC2bD 
N-IC62 


CDC2bAt. 
N161 


AJH2-COP9 




7 


43 


73 


109 


CCP8 


E2F3ca55 


E2F3 N- 
terminal 






8 


43 


74 


109 


CCP9 


CDC2bD 
N-IC9 


CDC2bAt. 
N161 


Arath 
CYCA2;2 


cyclin box 
motifs 1 
and 2; 
cyclin 

H pQtn i oti on 

box 


9 


9 


75 


75 


CCP10 


CKSBCO 
01 

v X 


CKSlAt 






10 


10 


76 


76 


CCP11 


CKSBCO 
11 


CKSlAt 


gibberellin- 
regulated 

nrntpin 

GASA1 
precursor 




11 


44 


77 


110 


CCP12 


CKSBC9 
8-7 

fCterml 

\ V»x Lwl 111 J 


CKS1 At 








4S 


78 


XXX 


CCP13 


CKSBC9 
8-7 

(Nterm) 


CKSlAt 






13 


45 


79 


111 


CCP14 


CKSBCl 

03-19 

(Cterm) 


CKSlAt 






14 


46 


80 


112 


CCP15 


CKSBCl 
99-20 


CKSlAt 


PSTTLRE-type 
CDK 


CDC2 
motifs 


15 


47 


81 


113 


CCP16 


E2F5BB 
CI 


E2F5 
dimerisati 
on domain 


DPa 


DNA-binding 
domain; DEF 
domain; DCB1 
and DCB2 
domain 


16 


48 


82 


114 


CCP17 


FL67BC4 
-2 


CKI4 






17 


17 


83 


83 


CCP18 


FL67BC1 
2-17 


CKI4 


RNA 

polymerase B 
transcription 
factor 3 




18 


49 


84 


115 


CCP19 


JUT1 


PLP1 






19 


19 


85 


85 


CCP20 


JUT2 


PLP1 






20 


50 


86 


116 


CCP21 


JUT3 


PLP1 






21 


50 


87 


116 


CCP22 


JUT6 


PLP1 


Submergence 
induced 




22 


51 


88 


117 
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protein2 of 
Oryza sativa 












CCP23 


kbpl 


KLPNTl 

36-508aa 

(motor 

domain) 

KLPNT2 

(TH65) 

73-186 aa 

(neck 

domain) 


HSF1 




23 


52 


89 


118 


CCP24 


kbp3 


KLPNTl 

(427- 

867aa) 

stalk 

domain 






24 


53 


90 


119 


CCP25 


kbp6 


KLPNT2 
(TH65) 
73-186 aa 
neck 
domain 






25 


54 


91 


120 


CCP26 


kbp9 


KLPNT2 
(TH65) 
73-186 aa 
neck 
domain 


AtKLPNTl 




26 


55 


92 


121 


CCP27 


kbpll 


KLPNT2 
(TH65) 
73-186 aa 
neck 
domain 






27 


56 


93 


122 


CCP28 


kbpl 2 


KLPNT2 
(TH65) 
73-186 aa 
neck 
domain 






28 


57 


94 


123 


CCP29 


kbpl 3 


KLPNT2 
(TH65) 
73-186 aa 
neck 
domain 






29 


29 


95 


95 


CCP30 


kbpl 5 


KLPNT2 
(TH65) 
73-186 aa 
neck 
domain 


Centromere/ 
microtubule 
binding 
protein CBF5 
from yeast 




30 


58 


96 


124 


CCP31 
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KLPNT2 


VU91C 




31 


59 


97 
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(TH65) 
73-608 aa 
stalk 
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calmodulin 
from yeast 












CCP32 


E2F5BB 
C16 


E2F5 
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on 






32 


60 


98 j 


126 


CCP33 


DPb 


/ 




DNA-binding 
domain; DEF 
domain; DCB1 
and DCB2 
domain 


33 


61 


99 


127 


CCP34 


E2F3cal 


E2F3 N- 
terminal 






34 


62 


100 


128 


CCP35 


E2F3ca2 


E2F3 N- 
terminal 






35 


63 


101 


129 


CCP36 


E2F3ca9 


E2F3 N- 
terminal 






36 


64 


102 


130 


CCP37 


E2F3cal2 


E2F3 N- 
terminal 




SAP 
domain 


37 


65 


103 


131 


CCP38 


E2F3ca50 


E2F3 N- 
tenninal 






38 


66 


104 


132 



Detailed studies of interactions between AtDPs (a and b forms, SEQ ID NO: 1 14 
and SEQ ID NO: 127 , respectively) and AtE2Fs (a and b forms; GenBank accession 
5 numbers AJ294534 and AJ294533, respectively) revealed that the regions of AtDPa and 
AtDPb involved in the binding of AtE2Fb are different. 

Binding of AtDPa to AtE2Fb requires at least the AtDPa dimerization domain and 
the whole (or possibly part of) the C-terminal domain of AtDPa. The N-tenninal domain 
and the DNA-binding domain of AtDPa do not seem to contribute to the interaction of 
10 AtDPa with AtE2Fb (Examples 1 1, 12, Table 5, Figure 54). 

Binding of AtDPb to AtE2Fb, however, only requires an intact AtDPb dimerization 
domain. Neither the region including the N-terminal and DNA-binding domains of 
AtDPb, nor the C-terminal region of AtDPb seem to contribute to the interaction of AtDPb 
with AtE2Fb (Examples 11,12, Table 5, Figure 55). These observations indicate that 
15 modulating the formation of specific E2F/DP-complexes may be useful in modulating cell 
cycle traversal and the regulation thereof. 

AtDPa and AtDPb, respectively, do not form homodimers but both interact with 
either AtE2Fa or AtE2Fb (Example 12, Table 5). In reciprocal experiments it was shown 
that the N-tenninal domain of AtE2Fa is not required for binding AtDPa or AtDPb. 
20 Likewise, the Rb-binding domains of AtE2Fa and AtE2Fb, respectively, do not seem to ' 
contribute to the binding to either AtDPa or AtDPb. The region of AtE2Fa encompassing 
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the dimerization domain and the marked box is sufficient for binding to AtDPa and AtDPb 
(Examples 11, 12, Fig. 50, Fig. 51, Table 5). The dimerization domain of AtE2Fs appears 
to be sufficient for binding to AtDPs. 

Accordingly, it is shown herein for the first time (for plant DPs and plant E2Fs) 
5 that the minimal DP and E2F proteins or corresponding coding DNA sequences that can be 
used in modifying E2F/DP-related processes, e.g., regulation of gene expression by 
E2F/DP, include: 

(A) Plant DP dimerization domain with or without (part of) the C-terminal DP 
domain. These domains include the proteins AtDPal 43-292 and AtDPal43-213 

10 (numbering indicates the amino acids included in said fragment relative to the full-length 
AtDPa protein) set forth in SEQ ID NO:221 and SEQ ID NO:222, respectively. The 
coding sequences corresponding to the foregoing amino acid sequences are set forth in 
SEQ ID NO:232 and SEQ ID NO:233, respectively. Also included are the corresponding 
regions of the AtDPb protein characterized by AtDPbl82-385 and AtDPbl82-263 (parts of 

15 the full-length AtDPb protein). The foregoing regions of AtDPb are set forth in SEQ ID 
NO:216 and SEQ ID NO:215, respectively, and the coding sequences corresponding 
thereto are set forth in SEQ ID NO:231 and SEQ ID NO:230, respectively. The AtDPbl- 
263 domain (SEQ ID NO:223) and the corresponding AtDPal -2 14 domain (SEQ ID 
NO:220) encoded by the nucleic acid sequences SEQ ID NO:234 and SEQ ID NO:239, 

20 respectively, can also be used. Further included are nucleic acid sequences hybridizing to 
SEQ ID NOs:229-234 or SEQ ID NO:239 or encoding a protein at least 50%, 55%, 60%, 
65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or more identical to SEQ ID NOs:2 1 1,215- 
216 and 220-223. 

(B) Plant E2F dimerization domain with or without (part of) the marked box. 

25 These domains include the proteins AtE2Fa232-282, AtE2Fa232-352 and AtE2Fa226-356 
set forth in SEQ ID NO:224, SEQ ID NO'225 and SEQ ID NO:205, respectively. The 
corresponding coding DNA sequences are set forth in SEQ ID NO:235, SEQ ID NO:236 
and SEQ ID NO: 228, respectively. Also included are the corresponding regions of the 
AtE2Fb protein characterized by AtE2Fbl 94-243 and AtE2Fbl 94-31 1 set forth in SEQ ID 

30 NO:226 and SEQ ID NO:227, respectively. The corresponding coding DNA sequences are 

serfollh^ff 

nucleic acid sequences hybridizing to SEQ ID NO:228 or SEQ ID NOs:235-238 or 
encoding a protein at least 70%, 75%, 80%, 85%, 90%, 95%, 98% identical to SEQ ID 
NO:205 or SEQ ID NOs:224-227, 
35 (C) Full-length plant DP and plant E2F proteins or corresponding DNA sequences 

may also be used to modify said E2F/DP-related processes. Furthermore, plant DP and 
plant E2F proteins or corresponding DNA sequences, or parts thereof, can be used either 
separately or in combination to modify said E2F/DP-related processes. This is underscored 
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by the demonstration that AtDPs and AtE2Fs are co-expressed in actively dividing cells 
and in at least some plant tissues (Example 13 and Figures 57 and 58). 

Various aspects of the invention are described in further detail in the following 
5 subsections: 

I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated nucleic acid molecules that encode 
CCP proteins or biologically active portions thereof, as well as nucleic acid fragments 

10 sufficient for use as hybridization probes to identify CCP-encoding nucleic acids (e.g., 
CCP mRNA) and fragments for use as PCR primers for the amplification or mutation of 
CCP nucleic acid molecules. As used herein, the term "nucleic acid molecule" is intended 
to include DNA molecules (e.g., cDNA or genomic DNA) and KNA molecules (e.g., 
mRNA) and analogs of the DNA or RNA generated using nucleotide analogs. The nucleic 

15 acid molecule can be single-stranded or double-stranded, but preferably is double-stranded 
DNA. 

An "isolated" nucleic acid molecule is one which is separated from other nucleic 
acid molecules which are present in the natural source of the nucleic acid. For example, 
with regards to genomic DNA, the term "isolated" includes nucleic acid molecules which 

20 are separated from the chromosome with which the genomic DNA is naturally associated. 
Preferably, an "isolated" nucleic acid is free of sequences which naturally flank the nucleic 
acid (i.e., sequences located at the 5 f and 3 ! ends of the nucleic acid) in the genomic DNA 
of the organism from which the nucleic acid is derived. For example, in various 
embodiments, the isolated CCP nucleic acid molecule can contain less than about 5 kb, 

25 4kb, 3kb, 2kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which naturally flank the 
nucleic acid molecule in genomic DNA of the cell from which the nucleic acid is derived. 
Moreover, an "isolated" nucleic acid molecule, such as a cDNA molecule, can be 
substantially free of other cellular material, or culture medium when produced by 
recombinant techniques, or substantially free of chemical precursors or other chemicals 

30 when chemically synthesized. 

A nucleic acid molecule of the present invention, e.g., a nucleic acid molecule 
having the nucleotide sequence of SEQ ID NO: 1-66 or 228-239, or a portion thereof, can 
be isolated using standard molecular biology techniques and the sequence information 
provided herein. For example, using all or portion of the nucleic acid sequence of SEQ ID 

35 NO: 1-66 or 228-239, as a hybridization probe, CCP nucleic acid molecules can be isolated 
using standard hybridization and cloning techniques (e.g., as described in Sambrook, J., 
Fritsh, E. F., and Maniatis, T. Molecular Cloning: A Laboratory Manual 2nd, ed, Cold 
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Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY, 1989). 

Moreover, a nucleic acid molecule encompassing all or a portion of SEQ ID NO : 1 - 
66 or 228-239 can be isolated by the polymerase chain reaction (PGR) using synthetic 
5 oligonucleotide primers designed based upon the sequence of SEQ ID NO: 1-66 or 228- 
239, respectively. 

A nucleic acid of the invention can be amplified using cDNA, mRNA or 
alternatively, genomic DNA, as a template and appropriate oligonucleotide primers 
according to standard PCR amplification techniques. The nucleic acid so amplified can be 
10 cloned into an appropriate vector and characterized by DNA sequence analysis. 

Furthermore, oligonucleotides corresponding to CCP nucleotide sequences can be prepared 
by standard synthetic techniques, e.g., using an automated DNA synthesizer. 

In a preferred embodiment, an isolated nucleic acid molecule of the invention 
comprises the nucleotide sequence shown in SEQ ID NO: 1-66 or 228-239. 
15 In another preferred embodiment, an isolated nucleic acid molecule of the invention 

comprises a nucleic acid molecule which is a complement of the nucleotide sequence 
shown in SEQ ID NO: 1-66 or 228-239, or a portion of any of these nucleotide sequences. 
A nucleic acid molecule which is complementary to the nucleotide sequence shown in 
SEQ ID NO: 1-66 or 228-239, is one which is sufficiently complementary to the nucleotide 
20 sequence shown in SEQ ID NO: 1-66 or 228-239, respectively, such that it can hybridize to 
the nucleotide sequence shown in SEQ ID NO: 1-66 or 228-239, respectively, thereby 
forming a stable duplex. 

In still another preferred embodiment, an isolated nucleic acid molecule of the 
present invention comprises a nucleotide sequence which is at least about 50%, 55%, 60%, 
25 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or more homologous to the nucleotide 

sequence (e.g., to the entire length of the nucleotide sequence) shown in SEQ ID NO: 1-66 
or 228-239, or a portion of any of these nucleotide sequences. 

Moreover, the nucleic acid molecule of the invention can comprise only a portion 
of the nucleic acid sequence of SEQ ID NO: 1-66 or 228-239, for example a fragment 
30 which can be used as a probe or primer or a fragment encoding a biologically active 
portion of a CCP protein. The nucleotide sequence determined from the cloning of the 
CCP gene allows for the generation of probes and primers designed for use in identifying 
and/or cloning other CCP family members, as well as CCP homologues from other 
species. The probe/primer typically comprises substantially purified oligonucleotide. The 
35 oligonucleotide typically comprises a region of nucleotide sequence that hybridizes under 
stringent conditions to at least about 12 or 15, preferably about 20 or 25, more preferably 
about 30, 35, 40, 45, 50, 55, 60, 65, or 75 consecutive nucleotides of a sense sequence of 
SEQ ID NO:l-66 or 228-239, or of a naturally occurring allelic variant or mutant of SEQ 
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ID NO: 1-66 or 228-239. In an exemplary embodiment, a nucleic acid molecule of the 
present invention comprises a nucleotide sequence which is at least 100, 150, 200, 250, 
300, 350, 400, 450, 500, 550, 600, 650, 700, 750, or 800 nucleotides in length and 
hybridizes under stringent hybridization conditions to a nucleic acid molecule of SEQ ID 
5 NO: 1-66 or 228-239. 

Probes based on the CCP nucleotide sequences can be used to detect transcripts or 
genomic sequences encoding the same or homologous proteins. In preferred embodiments, 
the probe further comprises a label group attached thereto, e.g., the label group can be a 
radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Such probes 

10 can be used as a part of a diagnostic test kit for identifying cells or tissues which 

misexpress a CCP protein, such as by measuring a level of a CCP-encoding nucleic acid in 
a sample of cells from a subject e.g., detecting CCP mRNA levels or determining whether 
a genomic CCP gene has been mutated or deleted. 

A nucleic acid fragment encoding a "biologically active portion of a CCP protein" 

15 can be prepared by isolating a portion of the nucleotide sequence of SEQ ID NO: 1-66 or 

228-239, which encodes a polypeptide having a CCP biological activity (the biological 

activities of the CCP proteins are described herein), expressing the encoded portion of the 

CCP protein {e.g., by recombinant expression in vitrei) and assessing the activity of the 

encoded portion of the CCP protein. 
20 The invention further encompasses nucleic acid molecules that differ from the 

nucleotide sequence shown in SEQ ID NO:l-66 or 228-239, due to the degeneracy of the 

genetic code and, thus, encode the same CCP proteins as those encoded by the nucleotide 

sequence shown in SEQ ID NO: 1-66 or 228-239. In another embodiment, an isolated 

nucleic acid molecule of the invention has a nucleotide sequence encoding a CCP protein. 

25 In addition to the CCP nucleotide sequences shown in SEQ ID NO: 1-66 or 228- 

239, it will be appreciated by those skilled in the art that DNA sequence polymorphisms 
that lead to changes in the amino acid sequences of the CCP proteins may exist within a 
population (e.g., an Arabidopsis or rice plant population). Such genetic polymorphism in 
the CCP genes may exist among individuals within a population due to natural allelic 

30 variation. As used herein, the terms "gene" and "recombinant gene" refer to nucleic acid 
molecules which include an open reading frame encoding an CCP protein, preferably a 
plant CCP protein, and can further include non-coding regulatory sequences, and introns. 
Such natural allelic variations include both functional and non-functional CCP proteins 
and can typically result in 1-5% variance in the nucleotide sequence of a CCP gene. Any 

35 and all such nucleotide variations and resulting amino acid polymorphisms in CCP genes 
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that are the result of natural allelic variation and that do not alter the functional activity of a 
CCP protein are intended to be within the scope of the invention. Differences in preferred 
codon usage are illustrated below for Agrobacterium tumefaciens (a bacterium), 
Arabidopsis thaliana, Medicago sativa (two dicotyledonous plants) and Oryza sativa (a 
monocotyledonous plant). These examples were extracted from 
http://www.kazusa.or.jp/codon . For example, the codon GGC (for glycine) is the most 
frequently used codon in A, tumefaciens (36.2 %o), is the second most frequently used 
codon in O. sativa but is used at much lower frequencies in A. thaliana and M sativa (9 %o 
and 8.4 %o , respectively). Of the four possible codons encoding glycine the GGC codon is 
most preferably used in A, tumefaciens and O. sativa. However, in A. thaliana the GGA 
(and GGU) codon is most preferably used, whereas in M sativa the GGU (and GGA) 
codon is most preferably used. 

Moreover, nucleic acid molecules encoding other CCP family members and, thus, 
which have a nucleotide sequence which differs from the CCP sequences of SEQ ID NO:l- 
66 or 228-239 are intended to be within the scope of the invention. For example, another 
CCP cDNA can be identified based on the nucleotide sequence of the plant CCP molecules 
described herein. Moreover, nucleic acid molecules encoding CCP proteins from different 
species, and thus which have a nucleotide sequence which differs from the CCP sequences 
of SEQ ID NO: 1 -66 or 228-239 are intended to be within the scope of the invention. For 
example, a human CCP cDNA can be identified based on the nucleotide sequence of a 
plant CCP. 

Nucleic acid molecules corresponding to natural allelic variants and homologues of 
the CCP cDNAs of the invention can be isolated based on their homology to the CCP 
nucleic acids disclosed herein using the cDNAs disclosed herein, or a portion thereof, as a 
hybridization probe according to standard hybridization techniques under stringent 
hybridization conditions. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 15, 20, 25, 30 or more nucleotides in length and hybridizes under 
stringent conditions to the nucleic acid molecule comprising the nucleotide sequence of 
SEQ ID NO: 1-66 or 228-239. In other embodiment, the nucleic acid is at least 30, 50, 100, 
150, 200, 250, 300, 350, 400, 450, 500, 550, or 600 nucleotides in length. As used herein, 
the term "hybridizes under stringent conditions" is intended to describe conditions for 
hybridization and washing under which nucleotide sequences at least 30%, 40%, 50%, or 
60% homologous to each other typically remain hybridized to each other. Preferably, the 
conditions are such that sequences at least about 70%, more preferably at least about 80%, 
even more preferably at least about 85% or 90% homologous to each other typically 
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acid. The antisense nucleic acid can be complementary to an entire CCP coding strand, or 
only to a portion thereof. In one embodiment, an antisense nucleic acid molecule is 
antisense to a "coding region" of the coding strand of a nucleotide sequence encoding 
CCP. The term "coding region" refers to the region of the nucleotide sequence comprising 
5 codons which are translated into amino acid residues. In another embodiment, the 

antisense nucleic acid molecule is antisense to a "noncoding region" of the coding strand of 
a nucleotide sequence encoding CCP. The term "noncoding region" refers to 5' and 3' 
sequences which flank the coding region that are not translated into amino acids (i.e., also 
referred to as 5' and 3 f untranslated regions). 
10 Given the coding strand sequences encoding CCP disclosed herein, antisense 

nucleic acids of the invention can be designed according to the rules of Watson and Crick 
base pairing. The antisense nucleic acid molecule can be complementary to the entire 
coding region of CCP mRNA, but more preferably is an oligonucleotide which is antisense 
to only a portion of the coding or noncoding region of CCP mRNA. For example, the 
15 antisense oligonucleotide can be complementary to the region surrounding the translation 
start site of CCP mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 
15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the 
invention can be constructed using chemical synthesis and enzymatic ligation reactions 
using procedures known in the art. For example, an antisense nucleic acid (e.g., an 
20 antisense oligonucleotide) can be chemically synthesized using naturally occurring 

nucleotides or variously modified nucleotides designed to increase the biological stability 
of the molecules or to increase the physical stability of the duplex formed between the 
antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine 
substituted nucleotides can be used. Examples of modified nucleotides which can be used 
25 to generate the antisense nucleic acid include 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 
5-iodouracil, hypoxanthine, xantine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 
5~carboxymethylaminomethyl-2-thioxaridine, 5-carboxymethylaminomethyluracil, 
dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1- 
methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 
30 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5- 
methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D- 
mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 
isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2- 
thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5- 
35 oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5~methyl-2-thiouracil, 3-(3-amino- 
3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the antisense 
nucleic acid can be produced biologically using an expression vector into which a nucleic 
acid has been subcloned in an antisense orientation (i.e., RNA transcribed from the inserted 
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nucleic acid will be of an antisense orientation to a target nucleic acid of interest, described 
further in the following subsection). Preferably, production of antisense nucleic acids in 
plants occurs by means of a stably integrated transgene comprising a promoter operative in 
plants, an antisense oligonucleotide, and a terminator. 
5 Other known nucleotide modifications include methylation, cyclization and 'caps' 

and substitution of one or more of the naturally occurring nucleotides with an analog such 
as inosine. Modifications of nucleotides include modifications generated by the addition to 
nucleotides of acridine, amine, biotin, cascade blue, cholesterol, Cy3®, Cy5®, Cy5.5® 
Dabcyl, digoxigenin, dinitrophenyl, Edans, 6-FAM, fluorescein, 3'-glyceryl, HEX, IRD- 
10 700, IRD-800, JOE, phosphate psoralen, rhodamine, ROX, thiol (SH), spacers, TAMRA, 
TET, AMCA-S®, SE, BODIPY^, Marina Blue®, Pacific Blue®, Oregon Green®, Rhodamine 
Green®, Rhodamine Red®, Rhodol Green® and Texas Red®. Polynucleotide backbone 
modifications include methylphosphonate, 2'-OMe-methylphosphonate RNA, 
phosphorothiorate, RNA, 2-OMeRNA. Base modifications include 2-amino-dA, 2- 
15 aminopurine, 3'-(ddA), 3 f dA(cordycepin), 7-deaza-dA, 8-Br-dA, 8-oxo-dA, N 6 -Me-dA ? 
abasic site (dSpacer), biotin dT, 2'-OMe-5Me-C, 2'-OMe-propynyl-C, 3'-(5-Me-dC), 3'- 
(ddC), 5-Br-dC, 5-I-dC, 5-Me-dC, 5-F-dC, carboxy-dT, convertible dA, convertible dC, 
convertible dG, convertible dT, convertible dU, 7-deaza-dG, 8-Br-dG, 8-oxo-dG, 0 6 -Me- 
dG, S6-DNP-dG, 4-methyl-indole, 5-nitroindole, 2'-OMe-inosine, 2'-dI, 0 6 -phenyl-dI 9 4- 
20 methyl-indole, 2 -deoxynebularine, 5-nitroindole, 2-aminopurine, dP(purine analogue), 
dK(pyrimidine analogue), 3-nitropyrrole, 2-thio-dT, 4-thio-dT, biotin-dT, carboxy-dT, O 4 - 
Me-dT, 0 4 -triazol dT, 2'-OMe-propynyl-U, 5-Br-dU, 2'-dU, 5-F-dU, 5-I-dU, 0 4 -triazol dU. 

The antisense nucleic acid molecules of the invention are typically introduced into 
a plant or administered to a subject or generated in situ such that they hybridize with or 
25 bind to cellular mRNA and/or genomic DNA encoding a CCP protein to thereby inhibit 
expression of the protein, e.g., by inhibiting transcription and/or translation. The 
hybridization can be by conventional nucleotide complementarity to form a stable duplex, 
or, for example, in the case of an antisense nucleic acid molecule which binds to DNA 
duplexes, through specific interactions in the major groove of the double helix. An 
30 example of a route of introduction or administration of antisense nucleic acid molecules of 
the invention include transformation in a plant or direct injection at a tissue site in a 
subject. Alternatively, antisense nucleic acid molecules can be modified to target selected 
cells and then administered systemically. For example, for systemic administration, 
antisense molecules can be modified such that they specifically bind to receptors or 
35 antigens expressed on a selected cell surface, e.g., by linking the antisense nucleic acid 
molecules to peptides or antibodies which bind to cell surface receptors or antigens. The 
antisense nucleic acid molecules can also be delivered to cells using the vectors described 
herein. To achieve sufficient intracellular concentrations of the antisense molecules, 
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vector constructs in which the antisense nucleic acid molecule is placed under the control 
of a constitutive promoter or a strong pol II or pol III promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is 
an a-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms specific 
5 double-stranded hybrids with complementary RNA in which, contrary to the usual p-units, 
the strands run parallel to each other (Gaultier et ah (1987) Nucleic Acids. Res. 15:6625- 
6641). The antisense nucleic acid molecule can also comprise a 2-o-methylribonucleotide 
QxiouQetal. (1987) Nucleic Acids Res. 15:6131-6148) or a chimeric RNA-DNA analogue 
(Inoue etal (1987) FEBS Lett. 215:327-330). 
10 In another embodiment, the antisense nucleic acid molecule further comprises a 

sense nucleic acid molecule complementary to the antisense nucleic acid molecule. Gene 
silencing methods based on such nucleic acid molecules are well known to the skilled 
artisan {e.g., Grierson et al (1998) WO 98/53083; Waterhouse et al. (1999) WO 
99/53050). 

15 In still another embodiment, an antisense nucleic acid of the invention is a 

ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity which are 
capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a 
complementary region. Thus, ribozymes {e.g., hammerhead ribozymes (described in 
Haselhoff and Gerlach (1988) Nature 334:585-591)) can be used to catalytically cleave 

20 CCP mRNA transcripts to thereby inhibit translation of CCP mRNA. A ribozyme having 
specificity for a CCP-encoding nucleic acid can be designed based upon the nucleotide 
sequence of a CCP cDNA disclosed herein {i.e., SEQ ID NO:l-66 or 228-239). For 
example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed in which the 
nucleotide sequence of the active site is complementary to the nucleotide sequence to be 

25 cleaved in a CCP-encoding mRNA. See, e.g. , Cech et al U.S. Patent No. 4,987,071 ; and 
Cech et al. U.S. Patent No. 5,1 16,742. Alternatively, CCP mRNA can be used to select a 
catalytic RNA having a specific ribonuclease activity from a pool of RNA molecules. See, 
e.g., Bartel, D. and Szostak, J.W. (1993) Science 261:1411-1418. 

The use of ribozymes for gene silencing in plants is known in the art {e.g., Atkins et 

30 al. (1 994) WO 94/00012; Lenne et al (1995) WO 95/03404; Lutziger et al (2000) WO 
00/00619; Prinsenef a/. (1997) WO 97/13865 and Scott etal (1997) WO/ 97/38116). 

Alternatively, CCP gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of the CCP {e.g., the CCP promoter 
and/or enhancers) to form triple helical structures that prevent transcription of the CCP 

35 gene in target cells. See generally, Helene, C. (1991) Anticancer Drug Des. 6(6):569-84; 
Helene, C. et al (1992) Ann. N Y. Acad. Sci. 660:27-36; and Maher, L.J. (1992) Bioassays 
14(12):807-15. 
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In yet another embodiment, the CCP nucleic acid molecules of the present 
invention can be modified at the base moiety, sugar moiety or phosphate backbone to 
improve, e.g., the stability, hybridization, or solubility of the molecule. For example, the 
deoxyribose phosphate backbone of the nucleic acid molecules can be modified to generate 
5 peptide nucleic acids (see Hyrup B. et al. (1996) Bioorganic & Medicinal Chemistry 4 (1): 
5-23). As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid 
mimics, e.g., DNA mimics, in which the deoxyribose phosphate backbone is replaced by a 
pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 
backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA 
10 under conditions of low ionic strength. The synthesis of PNA oligomers can be performed 
using standard solid phase peptide synthesis protocols as described in Hyrup B. et aL 
(1996) supra; Perry-O'Keefe et al. Proc. Natl. Acad. ScL 93: 14670-675. 

PNAs of CCP nucleic acid molecules can be used for increasing crop yield in 
plants or in therapeutic and diagnostic applications. For example, PNAs can be used as 

15 antisense or antigene agents for sequence-specific modulation of gene expression by, for 
example, inducing transcription or translation arrest or inhibiting replication. PNAs of 
CCP nucleic acid molecules can also be used in the analysis of single base pair mutations 
in a gene, (e.g., by PNA-directed PCR clamping); as 'artificial restriction enzymes' when 
used in combination with other enzymes, {e.g., SI nucleases (Hyrup B. (1996) supra)); or 

20 as probes or primers for DNA sequencing or hybridization (Hyrup B. et al. (1996) supra; 
Perry-O'Keefe supra). 

In another embodiment, PNAs of CCP can be modified, (e.g., to enhance their 
stability or cellular uptake), by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DN A chimeras, or by the use of liposomes or other techniques of drug 

25 delivery known in the art. For example, PNA-DNA chimeras of CCP nucleic acid 

molecules can be generated which may combine the advantageous properties of PNA and 
DNA. Such chimeras allow DNA recognition enzymes, (e.g., RNAse H and DNA 
polymerases), to interact with the DNA portion while the PNA portion would provide high 
binding affinity and specificity. PNA-DNA chimeras can be linked using linkers of 

30 appropriate lengths selected in terms of base stacking, number of bonds between the 
nucleobases, and orientation (Hyrup B. (1996) supra). The synthesis of PNA-DNA 
chimeras can be performed as described in Hyrup B. (1996) supra and Finn PJ. et al. 
(1996) Nucleic Acids Res. 24 (17): 3357-63. For example, a DNA chain can be 
synthesized on a solid support using standard phosphoramidite coupling chemistry and 

35 modified nucleoside analogs, e.g. , 5 , -(4-methoxytrityl)an^o-5 ! -deoxy-thymidine 

phosphoramidite, can be used as a between the PNA and the 5' end of DNA (Mag, M. et al. 
(1989) Nucleic Acid Res. 17: 5973-88). PNA monomers are then coupled in a stepwise 
manner to produce a chimeric molecule with a 5' PNA segment and a 3* DNA segment 
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(Finn P. J. et aL (1996) supra). Alternatively, chimeric molecules can be synthesized with 
a 5' DNA segment and a 3' PNA segment (Peterser, K.H. et aL (1975) Bioorganic Med. 
Chem. Lett. 5: 1119-11124). 

In other embodiments, the oligonucleotide may include other appended groups such 
5 as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating transport 
across the cell membrane (see, e.g., Letsinger et aL (1989) Proc. Natl. Acad. Sci. US. 
86:6553-6556; Lemaitre et aL (1987) Proc. Natl Acad. Sci. USA 84:648-652; PCT 
Publication No. W088/09810) or the blood-brain barrier (see, e.g., PCT Publication No. 
W089/10134). In addition, oligonucleotides can be modified with hybridization-triggered 
10 cleavage agents (See, e.g., Krol et aL (1988) Bio-Techniques 6:958-976) or intercalating 
agents. (See, e.g., Zon (1988) Pharm. Res. 5:539-549). To this end, the oligonucleotide 
may be conjugated to another molecule, (e.g., a peptide, hybridization triggered cross- 
linking agent, transport agent, or hybridization-triggered cleavage agent). 

15 II. Isolated CCP Proteins and Anti-CCP Antibodies 

One aspect of the invention pertains to isolated CCP proteins (e.g., the amino acid 
sequences set forth in SEQ ID NO:67-132, 205, 211, 215-216, or 220-227) and 
biologically active portions thereof, as well as polypeptide fragments suitable for use as 
immunogens to raise anti-CCP antibodies. In one embodiment, native CCP proteins can 

20 be isolated from cells or tissue sources by an appropriate purification scheme using 
standard protein purification techniques. In another embodiment, CCP proteins are 
produced by recombinant DNA techniques. Alternative to recombinant expression, a CCP 
protein or polypeptide can be synthesized chemically using standard peptide synthesis 
techniques. 

25 An "isolated" or "purified" protein or biologically active portion thereof is 

substantially free of cellular material or other contaminating proteins from the cell or tissue 
source from which the CCP protein is derived, or substantially free from chemical 
precursors or other chemicals when chemically synthesized. The language "substantially 
free of cellular material" includes preparations of CCP protein in which the protein is 

30 separated from cellular components of the cells from which it is isolated or recombinantly 
produced. In one embodiment, the language "substantially free of cellular material" 
includes preparations of CCP protein having less than about 30% (by dry weight) of non- 
CCP protein (also referred to herein as a "contaminating protein"), more preferably less 
than about 20% of non-CCP protein, still more preferably less than about 10% of non-CCP 

35 protein, and most preferably less than about 5% non-CCP protein. When the CCP protein 
or biologically active portion thereof is recombinantly produced, it is also preferably 
substantially free of culture medium, i.e., culture medium represents less than about 20%, 
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more preferably less than about 10%, and most preferably less than about 5% ofth/ 
volume of the protein preparation. 

The language "substantially free of chemical precursors or other chemicals" 
includes preparations of CCP protein in which the protein is separated from chemical 
5 precursors or other chemicals which are involved in the synthesis of the protein. In one 
embodiment, the language "substantially free of chemical precursors or other chemicals" 
includes preparations of CCP protein having less than about 30% (by dry weight) of 
chemical precursors or non-CCP chemicals, more preferably less than about 20% chemical 
precursors or non-CCP chemicals, still more preferably less than about 10% chemical 
10 precursors or non-CCP chemicals, and most preferably less than about 5% chemical 
precursors or non-CCP chemicals. 

Biologically active portions of a CCP protein include peptides comprising amino 
acid sequences sufficiently homologous to or derived from the amino acid sequence of the 
CCP protein, which include less amino acids than the full length CCP proteins, and exhibit 
15 at least one activity of a CCP protein. Typically, biologically active portions comprise a 
domain or motif with at least one activity of the CCP protein. A biologically active 
portion of a CCP protein can be a polypeptide which is, for example, at least 10, 25, 50, 
100 or more amino acids in length. 

To determine the percent identity of two amino acid sequences or of two nucleic 
20 acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can 
be introduced in one or both of a first and a second amino acid or nucleic acid sequence for 
optimal alignment and non-homologous sequences can be disregarded for comparison 
purposes). In a preferred embodiment, the length of a reference sequence aligned for 
comparison purposes is at least 30%, preferably at least 40%, more preferably at least 50%, 
25 even more preferably at least 60%, and even more preferably at least 70%, 80%, or 90% of 
the length of the reference sequence. The amino acid residues or nucleotides at 
corresponding amino acid positions or nucleotide positions are then compared. When a 
position in the first sequence is occupied by the same amino acid residue or nucleotide as 
the corresponding position in the second sequence, then the molecules are identical at that 
30 position (as used herein amino acid or nucleic acid "identity" is equivalent to amino acid or 
nucleic acid "homology"). The percent identity between the two sequences is a function 
of the number of identical positions shared by the sequences, taking into account the 
number of gaps, and the length of each gap, which need to be introduced for optimal 
alignment of the two sequences. 
35 The comparison of sequences and determination of percent identity between two 

sequences can be accomplished using a mathematical algorithm. In a preferred 
embodiment, the percent identity between two amino acid sequences is determined using 
the Needleman and Wunsch (J. Mol. Biol (48):444-453 (1970)) algorithm which has been 
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incorporated into the GAP program in the GCG software package (available at 
http://www.gcg.com), using either a Blosum 62 matrix or a PAM250 matrix, and a gap 
weight of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. In yet another 
preferred embodiment, the percent identity between two nucleotide sequences is 
5 determined using the GAP program in the GCG software package (available at 

http://www.gcg.com), using a NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 
70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. A preferred, non-limiting example of 
parameters to be used in conjunction with the GAP program include a Blosum 62 scoring 
matrix with a gap penalty of 12, a gap extend penalty of 4, and a frameshift gap penalty of 
10 5. 

In another embodiment, the percent identity between two amino acid or nucleotide 
sequences is determined using the algorithm of E. Meyers and W. Miller (Comput. Appl 
Biosci., 4:1 1-17 (1988)) which has been incorporated into the ALIGN program (version 
2.0 or version 2.0U), using a PAM120 weight residue table, a gap length penalty of 12 and 
1 5 a gap penalty of 4. 

The nucleic acid and polypeptide sequences of the present invention can further be 
used as a "query sequence" to perform a search against public databases to, for example, 
identify other family members or related sequences. Such searches can be performed using 
the NBLAST and XBLAST programs (version 2.0) of Altschul, et al (1990) J. Mol Biol 
215:403-10. BLAST nucleotide searches can be performed with the NBLAST program, 
score = 100, wordlength = 12 to obtain nucleotide sequences homologous to Kinase and 
Phosphatase nucleic acid molecules of the invention. BLAST protein searches can be 
performed with the XBLAST program, score = 100, wordlength = 3, and a Blosum62 
matrix to obtain amino acid sequences homologous to Kinase and Phosphatase polypeptide 
molecules of the invention. To obtain gapped alignments for comparison purposes, 
Gapped BLAST can be utilized as described in Altschul et al, (1997) Nucleic Acids Res. 
25(1 7):33 89-3402. When utilizing BLAST and Gapped BLAST programs, the default 
parameters of the respective programs (e.g., XBLAST and NBLAST) can be used. See 
http://www.ncbi.nlm.nih.gov. 

The invention also provides CCP chimeric or fusion proteins. As used herein, a 
CCP "chimeric protein" or "fusion protein" comprises a CCP polypeptide operatively 
linked to a non-CCP polypeptide. An "CCP polypeptide" refers to a polypeptide having an 
amino acid sequence corresponding to CCP, whereas a "non-CCP polypeptide" refers to a 
polypeptide having an amino acid sequence corresponding to a protein which is not 
substantially homologous to the CCP protein, e.g., a protein which is different from the 
CCP protein and which is derived from the same or a different organism. Within a CCP 
fusion protein the CCP polypeptide can correspond to all or a portion of a CCP protein. In 
a preferred embodiment, a CCP fusion protein comprises at least one biologically active 
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portion of a CCP protein. In another preferred embodiment, a CCP fusion protein 
comprises at least two biologically active portions of a CCP protein. Within the fusion 
protein, the term "operatively linked" is intended to indicate that the CCP polypeptide and 
the non-CCP polypeptide are fused in-frame to each other. The non-CCP polypeptide can 
5 be fused to the N-terminus or. C-terminus of the CCP polypeptide or can be inserted within 
the CCP polypeptide. The non-CCP polypeptide can, for example, be (histidine) 6 -tag, 
glutathione S-transferase, protein A, maltose-binding protein, dihydrofolate reductase, 
Tag*100 epitope (EETARFQPG YRS ; SEQ ID NO:199), c-myc epitope (EQKLISEEDL; 
SEQ ID NO:200), FLAG®-epitope (DYKDDDK; SEQ ID NO:201), lacZ, CMP 
10 (calmodulin-binding peptide), HA epitope (YPYDVPDYA; SEQ ID NO:202), protein C 
epitope (EDQVDPRLIDGK; SEQ ID NO:203) or VSV epitope (YTDIEMNRLGK; SEQ 
IDNO:204). 

For example, in one embodiment, the fusion protein is a GST-CCP fusion protein 
in which the CCP sequences are fused to the C-terminus of the GST sequences. Such 

15 fusion proteins can facilitate the purification of recombinant CCP. 

In another embodiment, the fusion protein is a CCP protein containing a 
heterologous signal sequence at its N-terminus. In certain host cells (e.g., plant or 
mammalian host cells), expression and/or secretion of CCP can be increased through use 
of a heterologous signal sequence. 

20 The CCP fusion proteins of the invention can be incorporated into pharmaceutical 

compositions and administered to a plant or a subject in vivo. The CCP fusion proteins can 
be used to affect the bioavailability of a CCP substrate. Use of CCP fusion proteins may 
be useful agriculturally for the increase of crop yields or therapeutically for the treatment 
of cellular growth related disorders, e.g., cancer. Moreover, the CCP-fusion proteins of the 

25 invention can be used as immunogens to produce anti-CCP antibodies in a subject, to 
purify CCP ligands and in screening assays to identify molecules which inhibit the 
interaction of CCP with a CCP substrate, e.g., a kinase such as CDC2b. 

Preferably, a CCP chimeric or fusion protein of the invention is produced by 
standard recombinant DNA techniques. For example, DNA fragments coding for the 

30 different polypeptide sequences are ligated together in-frame in accordance with 

conventional techniques, for example by employing blunt-ended or stagger-ended termini 
for ligation, restriction enzyme digestion to provide for appropriate termini, filling-in of 
cohesive ends as appropriate, alkaline phosphatase treatment to avoid undesirable joining, 
and enzymatic ligation. In another embodiment, the fusion gene can be synthesized by 

35 conventional techniques including automated DNA synthesizers. Alternatively, PCR 

amplification of gene fragments can be carried out using anchor primers which give rise to 
complementary overhangs between two consecutive gene fragments which can 
subsequently be annealed and reamplified to generate a chimeric gene sequence (see, for 
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example, Current Protocols in Molecular Biology, eds. Ausubel et aL John Wiley & Sons: 
1992). Moreover, many expression vectors are commercially available that already encode 
a fusion moiety {e.g., a GST polypeptide). A CCP-encoding nucleic acid can be cloned 
into such an expression vector such that the fusion moiety is linked in-frame to the CCP 
5 protein. 

The present invention also pertains to variants of the CCP proteins which function 
as either CCP agonists (mimetics) or as CCP antagonists. Variants of the CCP proteins 
can be generated by mutagenesis, e.g., discrete point mutation or truncation of a CCP 
protein. An agonist of the CCP proteins can retain substantially the same, or a subset, of 

10 the biological activities of the naturally occurring form of a CCP protein. An antagonist of 
a CCP protein can inhibit one or more of the activities of the .naturally occurring form of 
the CCP protein by, for example, competitively modulating a cellular activity of a CCP , 
protein. Thus, specific biological effects can be elicited by treatment with a variant of 
limited function. In one embodiment, treatment of a subject with a variant having a subset 

15 of the biological activities of the naturally occurring form of the protein has fewer side 
effects in a subject relative to treatment with the naturally occurring form of the CCP 
protein. 

In one embodiment, variants of a CCP protein which function as either CCP 
agonists (mimetics) or as CCP antagonists can be identified by screening combinatorial 

20 libraries of mutants, e.g., truncation mutants, of a CCP protein for CCP protein agonist or 
antagonist activity. In one embodiment, a variegated library of CCP variants is generated 
by combinatorial mutagenesis at the nucleic acid level and is encoded by a variegated gene 
library. A variegated library of CCP variants can be produced by, for example, 
enzymatically ligating a mixture of synthetic oligonucleotides into gene sequences such 

25 that a degenerate set of potential CCP sequences is expressible as individual polypeptides, 
or alternatively, as a set of larger fusion proteins (e.g., for phage display) containing the set 
of CCP sequences therein. There are a variety of methods which can be used to produce 
libraries of potential CCP variants from a degenerate oligonucleotide sequence. Chemical 
synthesis of a degenerate gene sequence can be performed in an automatic DNA 

30 synthesizer, and the synthetic gene then ligated into an appropriate expression vector. Use 
of a degenerate set of genes allows for the provision, in one mixture, of all of the 
sequences encoding the desired set of potential CCP sequences. Methods for synthesizing 
degenerate oligonucleotides are known in the art (see, e.g., Narang, S.A. (1983) 
Tetrahedron 39:3; Itakura et aL (1984) Annu. Rev. Biochem. 53:323; ItakuraetaL (1984) 

35 Science 198:1056; Ike et aL (1983) Nucleic Acid Res. 11:477. 

In addition, libraries of fragments of a CCP protein coding sequence can be used to 
generate a variegated population of CCP fragments for screening and subsequent selection 
of variants of a CCP protein. In one embodiment, a library of coding sequence fragments 
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can be generated by treating a double stranded PCR fragment of a CCP coding sequence 
with a nuclease under conditions wherein nicking occurs only about once per molecule, 
denaturing the double stranded DNA, renaturing the DNA to form double stranded DNA 
which can include sense/antisense pairs from different nicked products, removing single 
5 stranded portions from reformed duplexes by treatment with S I nuclease, and ligating the 
resulting fragment library into an expression vector. By this method, an expression library 
can be derived which encodes N-terminal, C-terminal and internal fragments of various 
sizes of the CCP protein. 

Several techniques are known in the art for screening gene products of 

10 combinatorial libraries made by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. Such techniques are adaptable for 
rapid screening of the gene libraries generated by the combinatorial mutagenesis of CCP 
proteins. The most widely used techniques, which are amenable to high through-put 
analysis, for screening large gene libraries typically include cloning the gene library into 

15 replicable expression vectors, transforming appropriate cells with the resulting library of 
vectors, and expressing the combinatorial genes under conditions in which detection of a 
desired activity facilitates isolation of the vector encoding the gene whose product was 
detected. Recrusive ensemble mutagenesis (REM), a new technique which enhances the 
frequency of functional mutants in the libraries, can be used in combination with the 

20 screening assays to identify CCP variants (Arkin and Yourvan (1992) Proc. Natl. Acad. 
Sci. USA 59:781 1-7815; Delgrave et al. (1993) Protein Engineering 6(3):327-331). 

In one embodiment, cell based assays can be exploited to analyze a variegated CCP 
library. For example, a library of expression vectors can be transfected into a cell line 
which ordinarily synthesizes and secretes CCP. The transfected cells are then cultured 

25 such that CCP and a particular mutant CCP are secreted and the effect of expression of the 
mutant on CCP activity in cell supematants can be detected, e.g., by any of a number of 
enzymatic assays. Plasmid DNA can then be recovered from the cells which score for 
inhibition, or alternatively, potentiation of CCP activity, and the individual clones further 
characterized. 

30 An isolated CCP protein, or a portion or fragment thereof, can be used as an 

immunogen to generate antibodies that bind CCP using standard techniques for polyclonal 
and monoclonal antibody preparation. A full-length CCP protein can be used or, 
alternatively, the invention provides antigenic peptide fragments of CCP for use as 
immunogens. The antigenic peptide of CCP comprises at least 8 amino acid residues and 

35 encompasses an epitope of CCP such that an antibody raised against the peptide forms a 
specific immune complex with CCP. Preferably, the antigenic peptide comprises at least 
10 amino acid residues, more preferably at least 15 amino acid residues, even more 
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preferably at least 20 amino acid residues, and most preferably at least 30 amino acid 
residues. 

Preferred epitopes encompassed by the antigenic peptide are regions of CCP that 
are located on the surface of the protein, e.g., hydrophilic regions. 
5 A CCP immunogen typically is used to prepare antibodies by immunizing a 

suitable subject, (e.g., rabbit, goat, mouse or other mammal) with the immunogen. An 
appropriate immunogenic preparation can contain, for example, recombinantly expressed 
CCP protein or a chemically synthesized CCP polypeptide. The preparation can further 
include an adjuvant, such as Freund's complete or incomplete adjuvant, or similar 
10 immunostimulatory agent. Immunization of a suitable subject with an immunogenic CCP 
preparation induces a polyclonal anti-CCP antibody response. 

Accordingly, another aspect of the invention pertains to anti-CCP antibodies. The 
term "antibody 11 as used herein refers to immunoglobulin molecules and immunologically 
active portions of immunoglobulin molecules, i.e., molecules that contain an antigen 

15 binding site which specifically binds (immunoreacts with) an antigen, such as CCP. 

Examples of immunologically active portions of immunoglobulin molecules include F(ab) 
and F(ab f ) 2 fragments which can be generated by treating the antibody with an enzyme 
such as pepsin. The invention provides polyclonal and monoclonal antibodies that bind 
CCP. The term "monoclonal antibody" or "monoclonal antibody composition", as used 

20 herein, refers to a population of antibody molecules that contain only one species of an 
antigen binding site capable of immunoreacting with a particular epitope of CCP. A 
monoclonal antibody composition thus typically displays a single binding affinity for a 
particular CCP protein with which it immunoreacts. 

Polyclonal anti-CCP antibodies can be prepared as described above by immunizing 

25 a suitable subject with a CCP immunogen. The anti-CCP antibody titer in the immunized 
subject can be monitored over time by standard techniques, such as with an enzyme linked 
immunosorbent assay (ELISA) using immobilized CCP. If desired, the antibody 
molecules directed against CCP can be isolated from the mammal (e.g., from the blood) 
and further purified by well known techniques, such as protein A chromatography to 

30 obtain the IgG fraction. At an appropriate time after immunization, e.g., when the anti- 
CCP antibody titers are highest, antibody-producing cells can be obtained from the subject 
and used to prepare monoclonal antibodies by standard techniques, such as the hybridoma 
technique originally described by Kohler and Milstein (1975) Nature 256:495-497) (see 
also, Brown et al. (1981) J. Immunol. 127:539-46; Brown et al (1980) J. Biol Chem 

35 .255:4980-83; Yeh et al. (1976) Proc. Natl. Acad. Sci. USA 76:2927-31; and Yeh et al 
(1982) Int. J. Cancer 29:269-75), the more recent human B cell hybridoma technique 
(Kozbor et al. (1983) Immunol Today 4:72), the EBV-hybridoma technique (Cole et al. 
(1985), Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96) or 
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trioma techniques. The technology for producing monoclonal antibody hybridomas is well 
known (see generally R. H. Kenneth, in Monoclonal Antibodies: A New Dimension In 
Biological Analyses, Plenum Publishing Corp., New York, New York (1980); E. A. Lerner 
(1981) Yale J. Biol Med, 54:387-402; M. L. Gefter et ah (1977) Somatic Cell Genet 
5 3:231-36). Briefly, an: immortal cell line (typically a myeloma) is fused to lymphocytes 
(typically splenocytes) from a mammal immunized with a CCP immunogen as described 
above, and the culture supernatants of the resulting hybridoma cells are screened to 
identify a hybridoma producing a monoclonal antibody that binds CCP. 

Any of the many well known protocols used for fusing lymphocytes and 

10 immortalized cell lines can be applied for the purpose of generating an anti-CCP 

monoclonal antibody (see, e.g., G. Galfre et al. (1977) Nature 266:55052; Gefter et ah 
Somatic Cell Genet, cited supra; Lemer, Yale J. Biol. Med, cited supra; Kenneth, 
Monoclonal Antibodies, cited supra). Moreover, the ordinarily skilled worker will 
appreciate that there are many variations of such methods which also would be useful. 

15 Typically, the immortal cell line (e.g., a myeloma cell line) is derived from the same 

mammalian species as the lymphocytes. For example, murine hybridomas can be made by 
fusing lymphocytes from a mouse immunized with an immunogenic preparation of the 
present invention with an immortalized mouse cell line. Preferred immortal cell lines are 
mouse myeloma cell lines that are sensitive to culture medium containing hypoxanthine, 

20 aminopterin and thymidine ("HAT medium"). Any of a number of myeloma cell lines can 
be used as a fusion partner according to standard techniques, e.g., the P3-NSl/l-Ag4-l, 
P3-x63-Ag8.653 or Sp2/0-Agl4 myeloma lines. These myeloma lines are available from 
ATCC. Typically, HAT-sensitive mouse myeloma cells are fused to mouse splenocytes 
using polyethylene glycol ("PEG"). Hybridoma cells resulting from the fusion are then 

25 selected using HAT medium, which kills unfused and unproductively fused myeloma cells 
(unfiised splenocytes die after several days because they are not transformed). Hybridoma 
cells producing a monoclonal antibody of the invention are detected by screening the 
hybridoma culture supernatants for antibodies that bind CCP, e.g., using a standard ELISA 
assay. 

30 Alternative to preparing monoclonal antibody-secreting hybridomas, a monoclonal 

anti-CCP antibody can be identified and isolated by screening a recombinant combinatorial 
immunoglobulin library (e.g., an antibody phage display library) with CCP to thereby 
isolate immunoglobulin library members that bind CCP. Kits for generating and screening 
phage display libraries are commercially available (e.g., the Pharmacia Recombinant 

35 Phage Antibody System, Catalog No. 27-9400-0 1 ; and the Stratagene SurfZAP™ Phage 
Display Kit, Catalog No. 240612). Additionally, examples of methods and reagents 
particularly amenable for use in generating and screening antibody display library can be 
found in, for example, Ladner et al. U.S. Patent No. 5,223,409; Kang et al PCT 
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Intemational Publication No. WO 92/18619; Dower et al. PCT International Publication 
No. WO 91/1 7271 ; Winter et al. PCT International Publication WO 92/20791 ; Markland et 
al. PCT International Publication No. WO 92/15679; Breitling et al. PCT International 
Publication WO 93/01288; McCafferty et al. PCT International Publication No. WO 
5 92/0 1 047; Garrard et al. PCT International Publication No. WO 92/09690; Ladner et al. 
PCT International Publication No. WO 90/02809; Fuchs et al. (1991) Bio/Technology 
9:1370-1372; Hay et al. (1992) Hum. Antibod. Hybridomas 3:81-85; Huse et al. (1989) 
Science 246: 1275-1281; Griffiths et al. (1993) EMBO J 12:725-734; Hawkins et al. (1992) 
J. Mol. Biol. 226:889-896; Clarkson et al. (1991) Nature 352:624-628; Gram et al. (1992) 
10 Proc. Natl. Acad. Sci. USA 89:3576-3580; Garrad et al. (1991) Bio/Technology 9:1373- 
1377; Hoogenboom et al. (1991) Nuc. Acid Res. 19:4133-4137; Barbas et al. (1991) Proc. 
Natl. Acad. Sci. USA 88:7978-7982; and McCafferty et al. Nature (1990) 348:552-554. 

Additionally, recombinant anti-CCP antibodies, such as chimeric and humanized 
monoclonal antibodies, comprising both human and non-human portions, which can be 

15 made using standard recombinant DNA techniques, are within the scope of the invention. 
Such chimeric and humanized monoclonal antibodies can be produced by recombinant 
DNA techniques known in the art, for example using methods described in Robinson et al. 
International Application No. PCT/US86/02269; Akira, et al. European Patent Application 
184,187; Taniguchi, M., European Patent Application 171,496; Morrison et al. European 

20 Patent Application 1 73,494; Neuberger et al. PCT International Publication No. WO 
86/01533; Cabilly et al. U.S. Patent No. 4,816,567; Cabilly et al. European Patent 
Application 125,023; Better et al. (1988) Science 240:1041-1043; Liu et al. (1987) Proc. 
Natl. Acad. Sci. USA 84:3439-3443; lAuetal. (1987)./ Immunol. 139:3521-3526; Sunef 
al. (1987) Proc. Natl. Acad Sci. USA 84:214-21 8; Nishimurae/ al. (1987) Cane. Res. 

25 47:999-1005; Wood et al. (1985) Nature 314:446-449; and Shaw et al. (1988) J. Natl. 
Cancer Inst. 80:1553-1559); Morrison, S. L. (1985) Science 229:1202-1207; Oi et al. 
(1986) BioTechniques 4:214; Winter U.S. Patent 5,225,539; Jones et al. (1986) Nature 
321 :552-525; Verhoeyan et al. (1988) Science 239:1534; and Beidler et al. (1988) J. 
Immunol. 141:4053-4060. 

30 An anti-CCP antibody (e.g., monoclonal antibody) can be used to isolate CCP by 

standard techniques, such as affinity chromatography or immunoprecipitation. An anti- 
CCP antibody can facilitate the purification of natural CCP from cells and of 
recombinantly produced CCP expressed in host cells. Moreover, an anti-CCP antibody 
can be used to detect CCP protein (e.g., in a cellular lysate or cell supernatant) in order to 

35 evaluate the abundance and pattern of expression of the CCP protein. These antibodies can 
also be used, for example, for the immunoprecipitation and immunolocalization of proteins 
according to the invention as well as for the monitoring of the synthesis of such proteins, 
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for example, in recombinant organisms, and for the identification of compounds interacting 
with the protein according to the invention. 

Anti-CCP antibodies can be used diagnosticaily to monitor protein levels in tissue 
as part of a clinical testing procedure, e.gvto, for example, determine the efficacy of a 
5 given treatment regimen. Detection can be facilitated by coupling (i. e. , physically linking) 
the antibody to a detectable substance. Examples of detectable substances include various 
enzymes, prosthetic groups, fluorescent materials, luminescent materials, bioluminescent 
materials, and radioactive materials. Examples of suitable enzymes include horseradish 
peroxidase, alkaline phosphatase, -galactosidase, or acetylcholinesterase; examples of 

10 suitable prosthetic group complexes include streptavidin/biotin and avidin/biotin; examples 
of suitable fluorescent materials include umbelliferone, fluorescein, fluorescein 
isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or 
phycoerythrin; an example of a luminescent material includes luminol; examples of 
bioluminescent materials include luciferase, luciferin, and aequorin, and examples of 

15 suitable radioactive material include 125 I, 13 l l, 35 S or 3 H. 

III. Computer Readable Means 

The CCP nucleotide sequences of the invention (e.g., SEQ ID NO:l-66 or 228-239) 
or amino acid sequences of the invention ( e.g., SEQ ID NO:67-132, 205, 21 1, 215-216, or 

20 220-227) are also provided in a variety of mediums to facilitate use thereof. As used 
herein, "provided" refers to a manufacture, other than an isolated nucleic acid or amino 
acid molecule, which contains a nucleotide or amino acid sequences of the present 
invention. Such a manufacture provides the nucleotide or amino acid sequences, or a 
subset thereof (e.g., a subset of open reading frames (ORFs)) in a form which allows a 

25 skilled artisan to examine the manufacture using means not directly applicable to 

examining the nucleotide or amino acid sequences, or a subset thereof, as they exist in 
nature or in purified form. 

In one application of this embodiment, a nucleotide or amino acid sequence of the 
present invention can be recorded on computer readable media. As used herein "computer 

30 readable media" includes any medium that can be read and accessed directly by a 

computer. Such media include, but are not limited to: magnetic storage media, such as 
floppy discs, hard disc storage medium, and magnetic tape; optical storage media such a 
CD-ROM; electrical storage media such as RAM and ROM; and hybrids of these 
categories such as magnetic/optical storage media. The skilled artisan will readily 

35 appreciate how any of the presently known computer readable mediums can be used to 
create a manufacture comprising computer readable medium having recorded thereon a 
nucleotide or amino acid sequence of the present invention. 
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As used herein "recorded" refers to a process of storing information on computer 
readable medium. The skilled artisan can readily adopt any of the presently known 
methods for recording information on a computer readable medium to generate 
manufactures comprising the nucleotide or amino acid sequence information of the present 
5 invention. 

A variety of data storage structures are available to a skilled artisan for creating a 
computer readable medium having recorded thereon a nucleotide or amino acid sequence 
of the present invention. The choice of the data storage structure will generally be based 
on the means chosen to access the stored information. In addition, a variety of data 
10 processor programs and formats can be used to store the nucleotide sequence information 
of the present invention on computer readable medium. The sequence information can be 
represented in a word processing text file, formatted in commercially-available software 
such as WordPerfect and Microsoft Word, or represented in the form of an ASCII file, 
stored in a database application, such as DB2, Sybase Oracle, or the like. The skilled 

15 artisan can readily adapt any number of dataprocessor structuring formats (e.g., text file or 
database) in order to obtain computer readable medium having recorded thereon the 
nucleotide sequence information of the present invention. 

By providing the nucleotide or amino acid sequences of the invention in computer 
readable form, the skilled artisan can routinely access the sequence information for a 

20 variety of purposes. For example, one skilled in the art can use the nucleotide or amino 
acid sequences of the invention in computer readable form to compare a target sequence or 
target structural motif with the sequence information stored within the data storage means. 
Search means are used to identity fragments or regions of the sequences of the invention 
which match a particular target sequence or target motif. 

25 As used herein, a 46 target sequence" can be any DNA or amino acid sequence of six 

or more nucleotide or two or more amino acids. A skilled artisan can readily recognize 
that the longer a target sequence is, the less likely a target sequence will be present as a 
random occurrence in the database. The most preferred sequence length of a target 
sequence is from about 10 to 100 amino acids or form about 30 to 300 nucleotide residues. 

30 However, it is well recognized that commercially important fragments, such as sequence 
fragments involved in gene expression and protein processing, may be shorter length. 

As used herein, "a target structural motif," or "target motif," refers to any rationally 
selected sequence or combination of sequences in which the sequence(s) are chosen based 
on a three-dimensional configuration which is formed upon the folding of the target motif. 

35 There are a variety of target motifs known in the art. Protein target motifs include, but are 
not limited to, enzyme active sites and signal sequences. Nucleic acid target motifs 
include, but are not limited to, promoter sequences, hairpin structures and inducible 
expression elements (protein binding sequences). 
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Computer software is publicly available which allows a skilled artisan to access 
sequence information provided in a computer readable medium for analysis and 
comparison to other sequences. A variety of known algorithms are disclosed publicly and 
a variety of commercially available software of conducting search means are and can be 
used in the computer-based systems of the present invention. Examples of such software 
include, but are not limited to, MacPatter (EMBL), BLASTN and BASTX (NCBIA). 

For example, software which implements the BLAST (Altschul et ah (1990,) J. 
Mol Biol 215:403-410) and BLAZE (Brutlag etal (1993) Comp. Chem. 17:203-207) 
search algorithms on a Sybase system can be used to identify open reading frames (ORFs) 
of the sequences of the invention which contain homology to ORFs or proteins from other 
libraries. Such ORFs are protein encoding fragments and are useful in producing 
commercially important proteins such as enzyme used in various reactions and in the 
production of commercially useful metabolites. 

IV. Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression vectors, 
containing a nucleic acid encoding a CCP protein (or a portion thereof). As used herein, 
the term "vector" refers to a nucleic acid molecule capable of transporting another nucleic 
acid to which it has been linked. One type of vector is a "plasmid", which refers to a 
circular double stranded DNA loop into which additional DNA segments can be ligated. 
Another type of vector is a viral vector, wherein additional DNA segments can be ligated 
into the viral genome. Certain vectors are capable of autonomous replication in a host cell 
into which they are introduced (e.g., bacterial vectors having a bacterial origin of 
replication and episomal mammalian vectors). Other vectors (e.g., non-episomal 
mammalian vectors) are integrated into the genome of a host cell upon introduction into 
the host cell, and thereby are replicated along with the host genome. Moreover, certain 
vectors are capable of directing the expression of genes to which they are operatively 
linked. Such vectors are referred to herein as "expression vectors". In general, expression 
vectors of utility in recombinant DNA techniques are often in the form of plasmids. In the 
present specification, "plasmid" and "vector" can be used interchangeably as the plasmid is 
the most commonly used form of vector. However, the invention is intended to include 
such other forms of expression vectors, such as viral vectors {e.g., replication defective 
retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of the 
invention in a form suitable for expression of the nucleic acid in a host cell, e.g., a plant 
cell, which means that the recombinant expression vectors include one or more regulatory 
sequences, selected on the basis of the host cells to be used for expression, which is 
operatively linked to the nucleic acid sequence to be expressed. Within a recombinant 
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*xpression vector, "operably linked" is intended to mean that the nucleotide sequence of 
interest is linked to the regulatory sequence(s) in a manner which allows for expression of 
the nucleotide sequence (e.g. , in an in vitro transcription/translation system or in a host cell 
when the vector is introduced into the host cell). The term "regulatory sequence" is 
5 intended to includes promoters, enhancers and other expression control elements (e.g. , 
polyadenylation signals). Such regulatory sequences are described, for example, in 
Goeddel; Gene Expression Technology: Methods in Enzymology 185, Academic Press, 
San Diego, CA (1990). Regulatory sequences include those which direct constitutive 
expression of a nucleotide sequence in many types of host cell and those which direct 

10 expression of the nucleotide sequence only in certain host cells (e.g. , tissue-specific 

regulatory sequences). It will be appreciated by those skilled in the art that the design of 
the expression vector can depend on such factors as the choice of the host cell to be 
transformed, the level of expression of protein desired, and the like. The expression 
vectors of the invention can be introduced into host cells to thereby produce proteins or 

15 peptides, including fusion proteins or peptides, encoded by nucleic acids as described 
herein (e.g., CCP proteins, mutant forms of CCP proteins, fusion proteins, and the like). 

The vectors of the invention comprise a selectable and/or scorable marker. 
Selectable marker genes useful for the selection of transformed plant cells, callus, plant 
tissue and plants are well known to those skilled in the art and comprise, for example, 

20 antimetabolite resistance as the basis of selection for dhfr, which confers resistance to 
methotrexate (Reiss, Plant Physiol. (Life Sci. Adv.) 13 (1994), 143-149); npt, which 
confers resistance to the aminoglycosides neomycin, kanamycin and paromycin (Herrera- 
Estrella, EMBOJ. 2 (1983), 987-995) and hygro, which confers resistance to hygromycin 
(Marsh, Gene 32 (1984), 481-485). Additional selectable genes have been described, 

25 namely trpB, which allow cells to utilize indole in place of tryptophan; hisD, which allows 
cells to utilize histinol in place of histidine (Hartman, Proc. Natl Acad. Sci. USA 85 
(1988), 8047); mannose-6-phosphate isomerase which allows cells to utilize mannose (WO 
94/20627) and ODC (ornithine decarboxylase) which confers resistance to the ornithine 
decarboxylase inhibitor, 2-(difluoromethyl)-DL-ornithine, DFMO (McConlogue, 1987, In: 

30 Current Communications in Molecular Biology, Cold Spring Harbor Laboratory ed.) or 
deaminase from Aspergillus terreus which confers resistance to Blasticidin S (Tamura, 
BioscL Biotechnol Biochem. 59 (1995), 2336-2338). 

Useful scorable markers are also known to those skilled in the art and are 
commercially available. Advantageously, the marker is a gene encoding luciferase 

35 (Giacomin, PI. Sci. 116 (1996), 59-72; Scikantha, J. Bact. 178 (1996), 121), green 

fluorescent protein (Gerdes, FEBSLett. 389 (1996), 44-47) or 6-glucuronidase (Jefferson, 
EMBO J. 6 (1987), 3901-3907). This embodiment is particularly useful for simple and 
rapid screening of cells, tissues and organisms containing a vector of the invention. 
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A "plant promoter" is a promoter capable of initiating transcription in plant cells. 
Exemplary plant promoters include, but are not limited to, those that are obtained from 
plants, plant viruses, and bacteria. Preferred promoters may contain additional copies of 
one or more specific regulatory elements, to further enhance expression and/or to alter the 
5 spatial expression and/or temporal expression of a nucleic acid molecule to which it is 
operably connected. For example, copper-responsive, glucocorticoid-responsive or 
dexamethasone-responsive regulatory elements may be placed adjacent to a heterologous 
promoter sequence driving expression of a nucleic acid molecule to confer copper 
inducible, glucocorticoid-inducible, or dexamethasone-inducible expression respectively, 
10 on said nucleic acid molecule. Examples of promoters under developmental control 

include promoters that preferentially initiate transcription in certain tissues, such as leaves, 
roots, seeds, endosperm, embryos, fibers, xylem vessels, tracheids, or sclerenchyma. Such 
promoters are referred to as "tissue preferred." Promoters which initiate transcription only 
in certain tissue are referred to as "tissue specific." A "cell type" specific promoter 
15 primarily drives expression in certain cell types in one or more organs, for example, 
vascular cells in roots or leaves. An "inducible" promoter is a promoter which is under 
environmental control. Examples of environmental conditions that may effect 
transcription by inducible promoters include anaerobic conditions or the presence of light. 
Tissue specific, tissue preferred, cell type specific, and inducible promoters constitute the 
20 class of "non-constitutive" promoters. A "constitutive" promoter is a promoter which is 
active under most environmental conditions. 

Another aspect of the invention pertains to host cells into which a recombinant 
expression vector of the invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. It is understood that such terms 
25 refer not only to the particular subject cell but to the progeny or potential progeny of such a 
cell. Because certain modifications may occur in succeeding generations due to either 
mutation or environmental influences, such progeny may not, in fact, be identical to the 
parent cell, but are still included within the scope of the term as used herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, a CCP protein 
30 can be expressed in plant cells, bacterial cells such as E. colU insect cells, yeast or 
mammalian cells (such as Chinese hamster ovary cells (CHO) or COS cells). Other 
suitable host cells are known to those skilled in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
35 "transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including 
calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 
transfection, lipofection, or electroporation. Suitable methods for transforming or 
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transfecting host cells can be found in Sambrook, et al. {Molecular Cloning: A Laboratory 
Manual 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY, 1989), and other laboratory manuals. 

Means for introducing a recombinant expression vector of this invention into plant 
5 tissue or cells include, but are not limited to; transformation using CaCl 2 and variations 
thereof, in particular the method described by Hanahan (J. MoLBioL 166, 557-560, 1983), 
direct DNA uptake into protoplasts (Krens et al, Nature 296: 72-74, 1982; Paszkowski et 
al, EMBO J. 3:2717-2722, 1984), PEG-mediated uptake to protoplasts (Armstrong et al, 
Plant Cell Reports 9: 335-339, 1990) microparticle bombardment, electroporation (Fromm 

10 et al, Proc. NatL Acad. Sci. (USA) 82:5824-5828, 1985), microinjection of DNA 

(Crossway et al, Mol. Gen. Genet 202:179-185, 1986), microparticle bombardment of 
tissue explants or cells (Christou et al, Plant Physiol 87: 671-674, 1988; Sanford, 
Particulate Science and Technology 5: 27-37, 1987), vacuum-infiltration of tissue with 
nucleic acid, or in the case of plants, T-DNA-mediated transfer from Agrobacterium to the 

15 plant tissue as described essentially by An et aL{ EMBO J 4:277-284, 1985), Herrera- 

Estrellae/a/. (Nature 303: 209-213, 1983a; EMBO J. 2: 987-995, 1983b; In: Plant Genetic 
Engineering, Cambridge University Press, N.Y., pp 63-93, 1985), or in planta method 
using Agrobacterium tumefaciens such as that described by Bechtold et aL, (C.R. Acad 
Sci. (Paris, Sciences de la vie/ Life Sciences)316: 1 194-1 199, 1993), Clough et al {Plant J. 

20 16: 735-743, 1998), Trieu et al {Plant J. 22:531-541, 2000) or Kloti (WO01/12828, 

2001). Methods for transformation of monocotyledonous plants are well known in the art 
and include Agrobacterium-mediated transformation (Cheng et al (1997) WO 97/48814; 
Hansen (1998) WO 98/54961; Hiei et al (1994) WO 94/00977; Hiei et al (1998) WO 
98/17813; Rikiishi et al (1999) WO 99/04618; Saito et al (1995) WO 95/06722), 

25 microprojectile bombardment (Adams et al (1999) US 5,969,213; Bowen et al (1998) US 
5,736,369; Chang et al (1994) WO 94/13822; Lundquist et al (1999) US 5,874,265/US 
5,990,390; Vasil and Vasil (1995) US 5,405,765; Walker et al (1999) US 5,955,362), 
DNA uptake (Eval et al (1993) WO 93/181,168), microinjection of Agrobacterium cells 
(von Holt 1994 DE 4309203), sonication (Finer et al (1997) US 5,693,512) and flower-dip 

30 or in planta- transformation (Kloti, WO01/12828, 2001). 

The vector DNA may further comprise a selectable marker gene to facilitate the 
identification and/or selection of cells which are transfected or transformed with a genetic 
construct. Suitable selectable marker genes contemplated herein include the ampicillin 
resistance (Amp 1 ), tetracycline resistance gene To*), bacterial kanamycin resistance gene 

35 (Kan 1 ), phosphinothricin resistance gene, neomycin phosphotransferase gene (nptU) 9 
hygromycin resistance gene, (5-glucuronidase (GUS) gene, chloramphenicol 
acetyltransferase (CAT) gene, green fluorescent protein (jgfp) gene (Haseloff et al, 1997), 
and luciferase gene. 
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For microparticle bombardment of cells, a microparticle is propelled into a cell to 
produce a transformed cell. Any suitable ballistic cell transformation methodology and 
apparatus can be used in performing the present invention. Exemplary apparatus and 
procedures are disclosed by Stomp et ah (U.S. Patent No. 5,122,466) and Sanford and 
Wolf (U.S. Patent No. 4,945,050). When using ballistic transformation procedures, the 
gene construct may incorporate a plasmid capable of replicating in the cell to be 
transformed. Examples of microparticles suitable for use in such systems include 1 to 5 
, \xm gold spheres. The DNA construct may be deposited on the microparticle by any 
suitable technique, such as by precipitation. 

A whole plant may be regenerated from the transformed or transfected cell, in 
accordance with procedures well known in the art. Plant tissue capable of subsequent 
clonal propagation, whether by organogenesis or embryogenesis, may be transformed with 
a gene construct of the present invention and a whole plant regenerated therefrom. The 
particular tissue chosen will vary depending on the clonal propagation systems available 
for, and best suited to, the particular species being transformed. Exemplary tissue targets 
include leaf disks, pollen, embryos, cotyledons, hypocotyls, megagametophytes, callus 
tissue, existing meristematic tissue (e.g., apical meristem, axillary buds, and root 
meristems), and induced meristem tissue (e.g., cotyledon meristem and hypocotyl 
meristem). 

The term "organogenesis", as used herein, includes a process by which shoots and 
roots are developed sequentially from meristematic centres. 

The term "embryogenesis", as used herein, includes a process by which shoots and 
roots develop together in a concerted fashion (not sequentially), whether from somatic 
cells or gametes. 

Preferably, the plant is produced according to the methods of the invention by 
transfecting or transforming the plant with a genetic sequence, or by introducing to the 
plant a protein, by any art-recognized means, such as microprojectile bombardment, 
microinjection, Agrobacterium-mediated transformation (including in planta 
transformation), protoplast fusion, or electroporation, amongst others. Most preferably the 
plant is produced by Agrobacterium-mediated transformation. 

Agrobacterium-mediated transformation or agrolistic transformation of plants, yeast, 
moulds or filamentous fungi is based on the transfer of part of the transformation vector 
sequences, called the T-DNA, to the nucleus and on integration of said T-DNA in the 
genome of said eukaryote. 

The term "Agrobacterium" as vised herein, includes a member of the 
Agrobacteriaceae, more preferably Agrobacterium or Rhizobacterium and most preferably 
Agrobacterium tumefaciens. 
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The term "T-DNA", or "transferred DNA", as used herien, includes the 
transformation vector flanked by T-DNA borders which is, after activation of the 
Agrobacterium vir genes, nicked at the T-DNA borders and is transferred as a single 
stranded DNA to the nucleus of an eukaryotic cell. 
5 As used herein, the terms "T-DNA borders* 1 , "T-DNA border region", or "border 

region" include either right T-DNA borders (RB) or left T-DNA borders (LB), which 
comprise a core sequence flanked by a border inner region as part of the T-DNA flanking 
the border and/or a border outer region as part of the vector backbone flanking the border. 
The core sequences comprise 22 bp in case of octopine-type vectors and 25 bp in case of 
10 nopaline-type vectors. The core sequences in the right border region and left border region 
form imperfect repeats. 

As used herein, the term "T-DNA transformation vector" or "T-DNA vector" 
includes any vector encompassing a T-DNA sequence flanked by a right and left T-DNA 
border consisting of at least the right and left border core sequences, respectively, and used 
15 for transformation of any eukaryotic cell. 

As used herein, the term "T-DNA vector backbone sequence" or "T-DNA vector 
backbone sequences" includes all DNA of a T-DNA containing vector that lies outside of 
the T-DNA borders and, more specifically, outside the nicking sites of the border core 
imperfect repeats. 

20 The present invention includes optimized T-DNA vectors such that vector 

backbone integration in the genome of a eukaryotic cell is minimized or absent. The term 
"optimized T-DNA vector" as used herein includes a T-DNA vector designed either to 
decrease or abolish transfer of vector backbone sequences to the genome of a eukaryotic 
cell. Such T-DNA vectors are known to the one of skill in the art and include those 

25 described by Hanson et al (1999) and by Stuiver et aL (1999 - WO9901563). 

The current invention clearly considers the inclusion of a DNA sequence encoding 
a CCP, homologue, analogue, derivative or immunologically active fragment thereof as 
defined supra, in any T-DNA vector comprising binary transformation vectors, super- 
binary transformation vectors, co-integrate transformation vectors, Ri-derived 

30 transformation vectors as well as in T-DNA carrying vectors used in agrolistic 
transformation. 

As used herein, the term "binary transformation vector" includes a T-DNA 
transformation vector comprising: a T-DNA region comprising at least one gene of interest 
and/or at least one selectable marker active in the eukaryotic cell to be transformed; and 
35 a vector backbone region comprising atleast origins of replication active in K coli and 
Agrobacterium and markers for selection in E. coli and Agrobacterium. Alternatively, 
replication of the binary transformation vector in Agrobacterium is dependent on the 
presence of a separate helper plasmid. The binary vector pGreen and the helper plasmid 



WO01/85946 



PCT/IBO 1/01307 



-60- 

pSoup fonn an example of such a system (Hellens et al. (2000), Plant Mol. Biol. 42, 819- 
832; http://www.pgreen.ac.uk). 

The T-DNA borders of a binary transformation vector can be derived from 
octopine-type or nopaline-type Ti plasmids or from both. The T-DNA of a binary vector is 
only transferred to a eukaryotic cell in conjunction with a helper plasmid. As used herein, 
the term "helper plasmid" includes a plasmid that is stably maintained mAgrobacterium 
and is at least carrying the set of vir genes necessary for enabling transfer of the T-DNA. 
The set of vir genes can be derived from either octopine-type or nopaline-type Ti plasmids 
or from both. 

As used herein, the term "super-binary transformation vector" includes a binary 
transformation vector additionally carrying in the vector backbone region a vir region of 
the Ti plasmid pTiBo542 of the super- virulent A. tumefaciens strain A281 (EP0604662, 
EP0687730). Super-binary transformation vectors are used in conjunction with a helper 
plasmid. 

As used herein, the term "co-integrate transformation vector" includes a T-DNA 
vector at least comprising: a T-DNA region comprising at least one gene of interest and/or 
at least one selectable marker active in plants; and a vector backbone region comprising at 
least origins of replication active in Escherichia coli and Agrobacterium, and markers for 
selection in E. coli and Agrobacterium, and a set of vir genes necessary for enabling 
transfer of the T-DNA. The T-DNA borders and the set of vir genes of the T-DNA vector 
can be derived from either octopine-type or nopaline-type Ti plasmids or from both. 

The term "Ri-derived plant transformation vector" includes a binary transformation 
vector in which the T-DNA borders are derived from a Ti plasmid and the binary 
transformation vector being used in conjunction with a "helper 5 Ri-plasmid carrying the 
necessary set of vir genes. 

The terms "agrolistics", "agrolistic transformation" or "agrolistic transfer" include a 
transformation method combining features of Agrobacterium-mediated transformation and 
of biolistic DNA delivery. As such, a T-DNA containing target plasmid is co-delivered 
with DNA/RNA enabling in planta production of VirDl and VirD2 with or without VirE2 
(Hansen and Chilton 1996; Hansen et al. 1997; Hansen and Chilton 1997 - WO9712046). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, 
can be used to produce (Le. 9 express) a CCP protein. Accordingly, the invention further 
provides methods for producing a CCP protein using the host cells of the invention. In one 
embodiment, the method comprises culturing the host cell of invention (into which a 
recombinant expression vector encoding a CCP protein has been introduced) in a suitable 
medium such that a CCP protein is produced. In another embodiment, the method further 
comprises isolating a CCP protein from the medium or the host cell. 
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The host cells of the invention can also be used to produce transgenic plant or non- 
human transgenic animals in which exogenous CCP sequences have been introduced into 
their genome or homologous recombinant plants or animals in which endogenous CCP 
sequences have been altered. Such plants and animals are useful for studying the function 
and/or activity of a CCP and for identifying and/or evaluating modulators of CCP activity. 

Trangenic Plants 

As used herein, "transgenic plant" includes a plant which comprises within its 
genome a heterologous polynucleotide. Generally, the heterologous polynucleotide is 
stably integrated within the genome such that the polynucleotide is passed on to successive 
generations. The heteroglogous polynucleotide may be integrated into the genome alone 
or as part of a recombinant expression cassette. "Transgenic" is used herein to include any 
cell, cell line, callus, tissue, plant part or plant, the genotype of which has been altered by 
the presence of heterologous nucleic acid including those transgenics initially so altered as 
well as those created by sexual crosses as asexual propagation from the initial transgenic. 
The term '^transgenic" as used herein does not encompass the alteration of the genome 
(chromosomal or extra-chromosomal) by conventional plant breeding methods or by 
naturally occurring event such as random cross-fertilization, non-recombinant viral 
infection, non-recombinant bacterial transformation, non-recombinant transposition, or 
spontaneous mutation. 

A transgenic plant of the invention can be created by introducing a CCP-encoding 
nucleic acid into the plant by placing it under the control of regulatory elements which 
ensure the expression in plant cells. These regulatory elements may be heterologous or 
homologous with respect to the nucleic acid molecule to be expressed as well with respect 
to the plant species to be transformed. In general, such regulatory elements comprise a 
promoter active in plant cells. These promoters can be used to modulate {e.g. increase or 
decrease) CCP content and/or composition in a desired tissue. To obtain expression in all 
tissues of a transgenic plant, preferably constitutive promoters are used, such as the 35 S 
promoter of CaMV (Odell, Nature 313 (1985), 810-812) or promoters from such genes as 
rice actin (McElroy et aL (1990) Plant Cell 2:163-171) maize H3 histone (Lepetit et al. 
(1992) Mol. Gen. Genet 23 1 :276-285) or promoters of the polyubiquitin genes of maize 
(Christensen, Plant Mol. Biol. 18 (1982), 675-689). In order to achieve expression in 
specific tissues of a transgenic plant it is possible to use tissue specific promoters (see, e.g., 
Stockhaus, EMBO J. 8 (1989), 2245-2251 or Table II, below). 
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Table H: 



GENE SOURCE 


EXPRESSION 
PATTERN 


REFERENCE 


cc-amyiase (Amy32b) 


aleurone 


Lanahan, M.B., e t aL, Plant Cell 4:203- 
211, 1992; Skriver, K., et al. Proc. Natl. 
Acad. Set (USA) 88: 7266-7270, 1991 


cathepsin P-like gene 


aleurone 


Cejudo, F J., et aL Plant Molecular Biology 
20: 849-856, 1992. 


Agrobacterium rhizogenes rolB 


cambium 


Nilsson er a/., Physiol. Plant. 100:456-462, 
1997 


PRP genes 


cell wall 


http://salus.medium.edu/mmg/tierney/html 


barley Itrl promoter 


endosperm 




synthetic promoter 


endosperm 


Vicente-Carbajosa et aL Plant J. 13: 629- 
640, 1998. 


AtPRP4 


flowers 


http://salus.medium.edu/mmg/tierney/html 


chalene synthase (chsA) 


flowers 


van aer jvieer, et ai., riant jmloi. tsioi. id, 
95-109, 1990. 


LAT52 


anther 


Twell et al Mol. Gen Genet. 217:240-245 


apetala-3 


flowers 




chitinase 


fruit fberries trraDes etc^ 


Thomas et al. CSIRO Plant Industry, 
Urrbrae, South Australia, Australia; 
http://winetitles.com.au/gwrdc/csh95- 1 .html 


rbcs-3A 


green tissue (eg leaf) 


Lam, E. et al, The Plant Cell 2: 857-866, 
1990.; Tucker et al, Plant Physiol 113: 
1303-1308, 1992. 


leaf-specific genes 


leaf 


Baszczynski, etaL, Nucl. Acid Res. 16: 
4732, 1988. 


AtPRP4 


leaf 


http://salus.medium.edu/rnmg/tierney/html 


Pinus cab-6 


leaf 


Yamamoto et aL, Plant Cell Physiol. 
35:773-778, 1994. 


SAM22 


senescent leaf 


Crowell, etaL, Plant MoL Biol. 18: 459- 
466, 1992. 


R. japonicum w/gene 


nodule 


United States Patent No. 4, 803, 165 


B.japonicum nifli gene 


nodule i 


United States Patent No. 5, 008, 194 


GmENOD40 


nodule 


Yang, etaL, The Plant J. 3: 573-585. 


PEP carboxylase (PEPC) 


nodule 


Pathirana, et aL, Plant MoL Biol. 20: 437- 
450, 1992. 


leghaemoglobin (Lb) 


nodule 


Gordon, etaL, J. Exp. Bot. 44: 1453-1465, 
1993. 


Tungro bacilliform virus gene 


phloem 


Bhattacharyya-Pakrasi, eial, The Plant J. 4: 
71-79, 1992. 


sucrose-binding protein gene 


plasma membrane 


Grimes, et aL, The Plant Cell 4:1561- 
1574, 1992. 
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pollen-specific genes 


pollen; microspore 


Albani, et al, Plant MoL Biol. 15: 605, 
1990; Albani, et al, Plant MoL Biol 16: 
501,1991) 


JLilU U 




{"riif*rrpm ot nl N/Tnl Gpn fTf»npf 994' 1 61 - 

168(1993) 


apg gene 


microspore 


Twell et al Sex. Plant Reprod. 6:217-224 


maize pollen-specific gene 


pollen 


Hamilton, et a/., Plant Mol. Biol. 18: 211- 
218, 1992. 


sunflower pollen-expressed gene 


pollen 


Baltz, et a/., The Plant J. 2: 713-721, 
1992. 


B. napus pollen-specific gene 


pollen;anther; tapetum 


Arnoldo, et a/., J. Cell. Biochem., 
Abstract No. Y101, 204, 1992. 


root-expressible genes 


roots 


Tingey, et a/., EMBO J. 6: 1, 1987. 


tobacco auxin-inducible gene 


root tip 


Van der Zaal, et a/., Plant Mol. Biol. 16, 
983, 1991. 


P-tubulin 


root 


Oppenheimer, et a/., Gene 63: 87, 1988. 


tobacco root-specific genes 


root 


Conkling, et a/., Plant Physiol. 93: 1203, 
1990. 


B., napus Gl-3b gene 


root 


United States Patent No. 5, 401, 836 


SbPRPl 


roots 


Suzuki et at., Plant Mol. Biol. 21: 109- 


AtPRPl; AtPRP3 


roots; root hairs 


http://salus.medium.edu/mmg/tierney/html 


RD2 gene 


root cortex 


http://www2.cnsu.edu/ncsu/research 


TobRB7 gene 


root vasculature 


http://www2.cnsu.edu/ncsu/research 


AtPRP4 


leaves; flowers; lateral 
root primordia 


http://salus.medium.edu/mmg/tierney/html 


seed-specific genes 


seed 


Simon, et al, Plant MoL Biol. 5: 191, 
1985; Scofield, et ai, J. Biol. Chem. 
262: 12202, 1987.; Baszczynski, et ai, 
Plant Mol. BioL ?4;633, 1990. 


Brazil Nut albumin 


seed 


Pearson, et a/., Plant Mol. Biol. 18: 235- 
245, 1992. 


legumin 


seed 


Ellis, et a/., Plant Mol. Biol. 10: 203-214, 
1988. 


glutelin (rice) 


seed 


Takaiwa, et aL, Mol. Gen. Genet. 208: 
15-22, 1986; Takaiwa, et a/., FEBS 
Letts. 221:43-47, 1987. 


zein 


seed 


Matzke et al Plant Mol Biol, 14(3):323- 
32 1990 
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napA 


seed 


Stalberg, et al, Plants 199: 515-519, 
1996. 


sunflower oleosin 


seed (embryo and dry 
seed) 


Cummins, et a/., Plant Mol. Biol. 19: 
873-876", 1 992 


LEAFY 


shoot meristem 


Weigel ef a/., Cell 69:843-859, 1992. 


Arahidopsis thalicma kncttl 


shoot meristem 


Accession numoer j\j iji ozz 


Malus domestica knl 


shoot meristem 


Accession number Z71981 


CLAVATA1 


shoot meristem 


Accession number AF049870 


stigma-specific genes 


stigma 


Nasrallah, et al., Proc. Natl. Acad. Sci. 
USA 85: 5551, 1988; Trick, et al., Plant 
Mol. Biol. 75:203, 1990. 


class I patatin gene 


tuber 


Liu era/., Plant Mol. Biol. 153:386-395, 
1991. 


blz2 


endosperm 


EP99 106056.7 


PCNA rice 


meristem 


Kosugi et a/, Nucleic Acids Research 
79:1571-1576, 1991; Kosugi S. and 
Ohashi Y, Plant Cell 9:1607-1619, 
1997. 



The promoters listed in the foregoing table are provided for the purposes of 
exemplification only and the present invention is not to be limited by the list provided 
therein. Those skilled in the art will readily be in a position to provide additional promoters 
5 that are useful in performing the present invention. The promoters listed may also be 
modified to provide specificity of expression as required. 

Known are also promoters which are specifically active in tubers of potatoes or in 
seeds of different plants species, such as maize, Vicia, wheat, barley and the like. Inducible 
promoters may be used in order to be able to exactly control expression under certain 

10 environmental or developmental conditions such as pathogens, anaerobia, or light. Examples 
of inducible promoters include the promoters of genes encoding heat shock proteins or 
microspore-specific regulatory elements (W096/16182). Furthermore, the chemically 
inducible Tet-system may be employed (Gatz, Mol. Gen. Genet. 227 (1991); 229-237). 
Further suitable promoters are known to the person skilled in the art and are described, e.g., 

15 in Ward (Plant Mol. Biol. 22 (1993), 361-366). The regulatory elements may further 
comprise transcriptional and/or^translational enhancers functional in plants cells. _ 
Furthermore, the regulatory elements may include transcription termination signals, such as a 
poly-A signal, which lead to the addition of a poly A tail to the transcript which may improve 
its stability. 
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In the case tbat a nucleic acid molecule according to the invention is expressed in the 
sense orientation, the coding sequence can be modified such that the protein is located in any 
desired compartment of the plant cell, e.g., the nucleus, endoplasmatic reticulum, the 
vacuole, the mitochondria, the plastids, the apoplast, or the cytoplasm. 
5 Methods for the introduction of foreign DNA into plants are also well known in the 

art. These include, for example, the transformation of plant cells or tissues with T-DNA 
using Agrobacterium tumefaciens or Agrobacterium rhizogenes, the fusion of protoplasts, 
direct gene transfer (see, e.g., EP-A 164 575), injection, electroporation, Holistic methods 
like particle bombardment, pollen-mediated transformation, plant RNA virus-mediated 

10 transformation, liposome-mediated transformation, transformation using wounded or 

enzyme-degraded immature embryos, or wounded or enzyme-degraded embryogenic callus 
and other methods known in the art. The vectors used in the method of the invention may 
contain further functional elements, for example "left border"- and "right border- 
sequences of the T-DNA of Agrobacterium which allow for stably integration into the 

15 plant genome. Furthermore, methods and vectors are known to the person skilled in the art 
which permit the generation of marker free transgenic plants, i.e., the selectable or scorable 
marker gene is lost at a certain stage of plant development or plant breeding. This can be 
achieved by, for example, cotransformation (Lyznik, Plant Mol. Biol. 13 (1989), 151-161; 
Peng, Plant Mol. Biol. 27 (1995), 91-104) and/or by using systems which utilize enzymes 

20 capable of promoting homologous recombination in plants (see, e.g. , WO97/0833 1 ; 

Bayley, Plant MoL Biol. 18 (1992), 353-361); Lloyd, Mol. Gen. Genet. 242 (1994), 653- 
657; Maeser, Mol. Gen. Genet. 230 (1991), 170-176; Onouchi, NucL Acids Res. 19 
(1991), 6373-6378). Methods for the preparation of appropriate vectors are described by, 
e.g., Sambrook (Molecular Cloning; A Laboratory Manual, 2nd Edition (1989), Cold 

25 Spring Harbor Laboratory Press, Cold Spring Harbor, NY). 

Suitable strains of Agrobacterium tumefaciens and vectors, as well as 
transformation of Agrobacteria, and appropriate growth and selection media are described 
in, for example, GV3101 (pMK90RK), Koncz, Mol. Gen. Genet. 204 (1986), 383-396; 
C58C1 (pGV 3850kan), Deblaere, NucL Acid Res. 13 (1985), 4777; Bevan, Nucleic. Acid 

30 Res. 12(1984), 871 1 ; Koncz, Proc. Natl. Acad. Sci. USA 86 (1989), 8467-8471 ; Koncz, 
Plant Mol. Biol. 20 (1992), 963-976; Koncz, Specialized vectors for gene tagging and 
expression studies. In: Plant Molecular Biology Manual Vol 2, Gelvin and Schilperoort 
(Eds.), Dordrecht, The Netherlands: Kluwer Academic Publ. (1994), 1-22; EP-A-120 516; 
Hoekema: The Binary Plant Vector System, Offsetdrukkerij Kanters B.V., Alblasserdam 

35 (1985), Chapter V, Fraley, Crit. Rev. Plant. Sci., 4, 1-46; An, EMBO J. 4 (1985), 277-287). 
Although the vise of Agrobacterium tumefaciens is preferred in the method of the invention, 
other Agrobacterium strains, such as Agrobacterium rhizogenes, may be used, for example, if 
a phenotype conferred by said strain is desired. 
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Methods for the transformation using biolistic methods are known to the person 
skilled in the art; see, e.g., Wan, Plant Physiol. 104 (1994), 37-48; Vasil, Bio/Technology 11 
(1993), 1 553-1558 and Christou (1996) Trends in Plant Science 1, 423-431. Microinjection 
can be performed as described in Potrykus and Spangenberg (eds.), Gene Transfer To Plants. 
5 Springer Verlag, Berlin, NY (1995). 

The transformation of most dicotyledonous plants may be performed using the 
methods described above or using transformation via biolistic methods as, e.g., described 
above as well as protoplast transformation, electroporation of partially permeabilized cells, or 
introduction of DNA using glass fibers. 
10 In general, the plants which are modified according to the invention may be derived 

from any desired plant species. They can be monocotyledonous plants or dicotyledonous 
plants, preferably they belong to plant species of interest in agriculture, wood culture or 
horticulture interest, such as crop plants (e.g. , maize, rice, barley, wheat, rye, oats), potatoes, 
oil producing plants (e.g., oilseed rape, sunflower, pea nut, soy bean), cotton, sugar beet, 
1 5 sugar cane, leguminous plants (e.g. , beans, peas), or wood producing plants, preferably trees. 

The present invention also relates to a transgenic plant cell which contains (preferably 
stably integrated into its genome) a nucleic acid molecule of the present invention linked to 
regulatory elements which allow expression of the nucleic acid molecule in plant cells. The 
presence and expression of the nucleic acid molecule in the transgenic plant cells leads to the 
20 synthesis of a CCP protein and may lead to physiological and phenotypic changes in plants 
containing such cells. 

Transformed plant cells which are derived by any of the above transformation 
techniques can be cultured to regenerate a whole plant which possesses the transformed 
genotype. Such regeneration techniques often rely on manipulation of certain 
25 phytohormones in a tissue culture growth medium, typically relying on a biocide and/or 
herbicide marker which has been introduced with a polynucleotide of the present 
invention. 

Plant cells transformed with a plant expression vector can be regenerated, e.g., from 
single cells, callus tissue or leaf discs according to standard plant tissue culture techniques. 

30 It is well known in the art that various cells, tissues, and organs from almost any plant can 
be successfully cultured to regenerate an entire plant. Plant regeneration from cultured 
protoplasts is described in Evans et ah, Protoplasts Isolation and Culture, Handbook of 
Plant Cell Culture, Macmillilan Publishing Company, New York, pp. 124-176 (1983); and 
Binding Regeneration of Plants, Plant Protoplasts, CRC Press, Boca Raton, pp^ 21-73 

35 (1985). 

Transformed plant cells, calli or explant can be cultured on regeneration medium in 
the dark for several weeks, generally about 1 to 3 weeks to allow the somatic embryos to 
mature. Preferred regeneration media include media containing MS salts, such as PHI-E 
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and PHI-F media. The plant cells, calli or explant are then typically cultured on rooting 
medium in a light/dark cycle until shoots and roots develop. Methods for plant 
regeneration are known in the art and preferred methods are provided by Kamo et al, (Bot. 
Gaz. 146(3):324-334, 1985), West et al, {The Plant Cell 5:1361-1369. 1993), and Duncan 
5 etal (Planta 165:322-332, 1985). 

Small plantlets can then be transferred to tubes containing rooting medium and 
allowed to grow and develop more roots for approximately another week. The plants can 
then be transplanted to soil mixture in pots in the greenhouse. 

The regeneration of plants containing the foreign gene introduced by 
10 Agrobacterium from leaft explants can be achieved as described by Horsch et al, Science, 
227:1229-123 1 (1985). In this procedure, transformants are grown in the presence of a 
selection agent and in a medium that induces the regeneration of shoots in the plant species 
being transformed as described by Fraley et al, Proc. Natl Acad. Sci, U.S.A. 80:4803 
(1983). This procedure typically produces shoots within two to four weeks and these 
15 transformant shoots are then transferred to an appropriate root-inducing medium 

containing the selective agent and an antibiotic to prevent bacterial growth. Transgenic 
plants of the present invention may be fertile or sterile. 

Regeneration can also be obtained from plant callus, explants, organs, or parts 
thereof. Such regeneration techniques are described generally in Klee et al, Ann. Rev. of 
20 Plant Phys. , 38:467-486(1987). The regeneration of plants from either single plant 

protoplasts or various explants is well known in the art. See, from example, Methods for 
Plant Molecular Biology, A. Weissbach and H. Weissback, eds., Academic Press, Inc., San 
Diego, Calif. (1988). This regeneration and growth process includes the steps of selection 
of transformant cells and shoots, rooting ht transformant shoots and growth of the plantlets 
25 in soil. For maize cell culture and regeneration see generally, The Maize Handbook, 

Freeling and Walbot, Eds., Springer, New York (1994); Corn and Corn Improvement, 3 rd 
edition, Sprague and Dudley Eds., American Society of Agronomy, Madison, Wisconsin 
(1988). 

One of skill will recognize that after the recombinant expression cassette is stably 
30 incorporated in transgenic plants and confirmed to be operable, it can be introduced into 
other plants by sexual crossing. Any of a number of standard breeding techniques can be 
used, depending upon the species to be crossed. 

In vegetatively propagated crops, mature transgenic plants can be propagated by the 
taking of cuttings or by tissue culture techniques to produce multiple identical plants. 
35 Selection of desirable transgenics is made and new varieties are obtained and propagated 
vegetative] v for commercial use. In seed propagated crops, mature transgenic plants can 
be self crossed to produce a homozygous inbred plant. The inbred plant produces seed 
containing the newly introduced heterologous nucleic acid. These seeds can be grown to 
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produce plants that would produce the selected phenotype, (e.g., altered cell cycle content 
or composition). 

Parts obtained from the regenerated plant, such as flowers, seeds, leaves, branches, 
fruit and the like are included in the invention, provided that these parts comprise cells 
comprising the isolated nucleic acid of the present invention. Progeny and variants, and 
mutants of the regenerated plants are also included within the scope of the invention, 
provided that these parts comprise the introduced nucleic acid sequences. 

Transgenic plants expressing the selectable marker can be screened for 
transmission of the nucleic acid of the present invention by, for example, standard 
immunoblot and DNA detection techniques. Transgenic lines are also typically evaluated 
on levels of expression of the heterologous nucleic acid. Expression at the RNA level can 
be determined initially to identify and quantitate expression-positive plants. Standard 
techniques for RNA analysis can be employed and include PCR amplification assays using 
oligonucleotide primers designed to amplify only the heterologous RNA templates and 
solution hybridization assays using heterologous nucleic acid-specific probes. The RNA- 
positive plants can then analyzed for protein expression by Western immunoblot analysis 
using the specifically reactive antibodies of the present invention. In addition, in situ 
hybridization and immimocytochemistry according to standard protocols can be done using 
heterologous nucleic acid specific polynucleotide probes and antibodies, respectively, to 
localize sites of expression within transgenic tissue. Generally, a number of transgenic 
lines are usually screened for the incorporated nucleic acid to identify and select plants 
with the most appropriate expression profiles. 

A preferred embodiment of the invention is a transgenic plant that is homozygous 
for the added heterologous nucleic acid; Le. 9 a transgenic plant that contains two added 
nucleic acid sequences, one gene at the same locus on each chromosome of a chromosome 
pair. A homozygous transgenic plant can be obtained by sexually mating (selfing) a 
heterozygous transgenic plant that contains a single added heterologous nucleic acid, 
germinating some of the seed produced and analyzing the resulting plants produced for 
altered cell division relative to a control plant (i.e., native, non-transgenic). Back-crossing 
to a parental plant and out-crossing with a non-transgenic plant are also contemplated. 

The present invention also relates to transgenic plants and plant tissue comprising 
transgenic plant cells according to the invention. Due to the (over)expression of a CCP 
molecule, e.g., at developmental stages and/or in plant tissue in which they do not naturally 
occur, these transgenic plants may show various physiological, developmental and/or 
morphological modifications in comparison to wild-type plants. 

Therefore, part of this invention is the use of the CCP molecules to modulate the 
cell cycle and/or plant cell division and/or growth in plant cells, plant tissues, plant organs 
and/or whole plants. To the scope of the invention also belongs a method for influencing 
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the activity of CDKs such as CDC2a, or CDC2b, CKSs, CKIs, PLPs and KLPNTs in a 
plant cell by transforming the plant cell with a nucleic acid molecule according to the 
invention and/or manipulation of the expression of the molecule. 

Furthermore, the invention also relates to a transgenic plant cell which contains 
5 (preferably stably integrated into its genome) a nucleic acid molecule of the invention or 
part thereof, wherein the transcription and/or expression of the nucleic acid molecule or 
part thereof leads to reduction of the synthesis of a CCP. In a preferred embodiment, the 
reduction is achieved by an anti-sense, sense, ribozyme, co-suppression and/or dominant 
mutant effect. The reduction of the synthesis of a protein according to the invention in the 

10 transgenic plant cells can result in an alteration in, e.g., cell division. In transgenic plants 
comprising such cells this can lead to various physiological, developmental and/or 
morphological changes. 

In yet another aspect, the invention relates to harvestable parts and to propagation 
material of the transgenic plants of the invention which either contain transgenic plant cells 

15 expressing a nucleic acid molecule according to the invention or which contain cells which 
show a reduced level of the described protein. Harvestable parts can be in principle any 
useful parts of a plant, for example, flowers, pollen, seedlings, tubers, leaves, stems, fruit, 
seeds, roots etc. Propagation material includes, for example, seeds, fruits, cuttings, 
seedlings, tubers, rootstocks, and the like. 

20 

Tramgenic Animals 

As used herein, a "transgenic animal" is a non-human animal, preferably a 
mammal, more preferably a rodent such as a rat or mouse, in which one or more of the 
cells of the animal includes a transgene. Other examples of transgenic animals include 

25 non-human primates, sheep, dogs, cows, goats, chickens, amphibians, and the like. A 
transgene is exogenous DNA which is integrated into the genome of a cell from which a 
transgenic animal develops and which remains in the genome of the mature animal, 
thereby directing the expression of an encoded gene product in one or more cell types or 
tissues of the transgenic animal. As used herein, a "homologous recombinant animal" is a 

30 non-human animal, preferably a mammal, more preferably a mouse, in which an 
endogenous CCP gene has been altered by homologous recombination between the 
endogenous gene and an exogenous DNA molecule introduced into a cell of the animal, 
e.g., an embryonic cell of the animal, prior to development of the animal. 

A transgenic animal of the invention can be created by introducing a CCP-encoding 

35 nucleic acid into the male pronuclei of a fertilized oocyte, e.g., by microinjection, retroviral 
infection, and allowing the oocyte to develop in a pseudopregnant female foster animal. 
The CCP cDNA sequence of SEQ ID NO: 1-66 or 228-239 can be introduced as a 
transgene into the genome of a non-human animal. Alternatively, a nonhuman homologue 
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of a human CCP gene, such as a mouse or rat CCP gene, can be used as a transgene. 
Alternatively, a CCP gene homologue, such as another CCP family .member, can be 
isolated based on hybridization to the CCP cDNA sequences of SEQ ID NO: 1-66 or 228- 
239 (described further in subsection I above) and used as a transgene. Intronic sequences 
5 and polyadenylation signals can also be included in the transgene to increase the efficiency 
of expression of the transgene. A tissue-specific regulatory sequence(s) can be operably 
linked to a CCP transgene to direct expression of a CCP protein to particular cells. 
Methods for generating transgenic animals via embryo manipulation and microinjection, 
particularly animals such as mice, have become conventional in the art and are described, 

10 for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, both by Leder et al, U.S. 

Patent No. 4,873,191 by Wagner et ah and in Hogan, B., Manipulating the Mouse Embryo, 
(Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). Similar methods 
are used for production of other transgenic animals. A transgenic founder animal can be 
identified based upon the presence of a CCP transgene in its genome and/or expression of 

15 CCP mRNA in tissues or cells of the animals. A transgenic founder animal can then be 
used to breed additional animals carrying the transgene. Moreover, transgenic animals 
carrying a transgene encoding a CCP protein can further be bred to other transgenic 
animals carrying other transgenes. 

20 V. Agricultural, Phytopharmaceutical and Pharmaceutical Compositions 

The CCP nucleic acid molecules, CCP proteins, and anti-CCP antibodies (also 
referred to herein as "active compounds") of the invention can be incorporated into 
compositions useful in agriculture and in plant cell and tissue culture. Plant protection 
compositions can be prepared by conventional means commonly used for the application 

25 of, for example, herbicides and pesticides. For example, certain additives known to those 
skilled in the art stabilizers or substances which facilitate the uptake by the plant cell, plant 
tissue or plant may be used. 

The CCP nucleic acid molecules, CCP proteins, and anti-CCP antibodies (also 
referred to herein as "active compounds") of the invention can also be incorporated into 

30 pharmaceutical compositions suitable for administration into animals. Such compositions 
typically comprise the nucleic acid molecule, protein, or antibody and a pharmaceutically 
acceptable carrier. As used herein the language "pharmaceutically acceptable carrier" is 
intended to include any and all solvents, dispersion media, coatings, antibacterial and 
antifungal agents, isotonic and absorption delaying agents, and the like, compatible with 

35 pharmaceutical administration. The use of such media and agents for pharmaceutically 
active substances is well known in the art. Except insofar as any conventional media or 
agent is incompatible with the active compound, use thereof in the compositions is 
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contemplated. Supplementary active compounds can also be incorporated into the 
compositions. 

The nucleic acid molecules of the invention can be inserted into vectors and used as 
gene therapy vectors. Gene therapy vectors can be delivered to a plant or subject by, for 
5 example, injection, local administration (see U.S. Patent 5,328,470) or by stereotactic 
injection (see e.g., Chen et ah (1994) Proc. Natl Acad Sci. USA 91:3054-3057). The 
agricultural or pharmaceutical preparation of the gene therapy vector can include the gene 
therapy vector in an acceptable diluent, or can comprise a slow release matrix in which the 
gene delivery vehicle is imbedded. Alternatively, where the complete gene delivery vector 
10 can be produced intact from recombinant cells, e.g., retroviral vectors, the agricultural or 
pharmaceutical preparation can include one or more cells which produce the gene delivery 
system. 

The agricultural and pharmaceutical compositions can be included in a container, 
pack, or dispenser together with instructions for administration. 

15 

VI. Uses and Methods of the Invention 

The nucleic acid molecules, proteins, protein homologues, and antibodies described 
herein can be used in one or more of the following methods: a) agricultural uses (e.g., to 

20 increase plant yield and to develop phytopharmaceuticals); b) screening assays; c) 

predictive medicine (e.g., diagnostic assays, prognostic assays, monitoring clinical trials); 
d) methods of treatment (e.g., phytotherapeutic, therapeutic and prophylactic); e) 
transcriptomics; f) proteomics; g) metabolomics; h) ligandomics; and i) pharmacogenetics 
or pharmacogenomics. The isolated nucleic acid molecules of the invention can be used, 

25 for example, to express CCP protein (e.g., via a recombinant expression vector in a host 
cell or in gene therapy applications), to detect CCP mRNA (e.g., in a biological sample) or 
a genetic alteration in a CCP gene, and to modulate CCP activity, as described further 
below. The CCP proteins can be used to treat disorders characterized by insufficient or 
excessive production of a CCP substrate or production of CCP inhibitors. In addition, the 

30 CCP proteins can be used to screen for naturally occurring CCP substrates, to screen for 
drugs or compounds which modulate CCP activity, as well as to treat disorders 
characterized by insufficient or excessive production of CCP protein or production of CCP 
protein forms which have decreased or aberrant activity compared to CCP wild type 
protein. Moreover, the anti-CCP antibodies of the invention can be used to detect and 

35 isolate CCP proteins, regulate the bioavailability of CCP proteins, and modulate CCP 
activity. 
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A. Agricultural Uses : 

In. another embodiment of the invention, a method is provided for modifying cell 
fate and/or plant development and/or plant morphology and/or biochemistry and/or 
physiology comprising the modification of expression in particular cells, tissues or organs 
5 of a plant, of a genetic sequence encoding a CCP, e.g. , a CCP operably connected with a 
plant-operable promoter sequence. 

Modulation of the expression in a plant of a CCP or a homologue, analogue or 
derivative thereof as defined in the present invention can produce a range of desirable 
phenotypes in plants, such as, for example, the modification of one or more morphological, 

10 biochemical, or physiological characteristics including: (i) modification of the length of the 
Gl and/or the S and/or the G2 and/or the M phase of the cell cycle of a plant; (ii) 
modification of the Gl/S and/or S/G2 and/or G2/M and/or M/Gl phase transition of a plant 
cell; (iii) modification of the initiation, promotion, stimulation or enhancement of cell 
division; (iv) modification of the initiation, promotion, stimulation or enhancement of 

15 DNA replication;(v) modification of the initiation, promotion, stimulation or enhancement 
of seed set and/or seed size and/or seed development; (vi) modification of the initiation, 
promotion, stimulation or enhancement of tuber formation; (vii) modification of the 
initiation, promotion, stimulation or enhancement of fruit formation; (viii) modification of 
the initiation, promotion, stimulation or enhancement of leaf formation; (ix) modification 

20 of the initiation, promotion, stimulation or enhancement of shoot initiation and/or 

development; (x) modification of the initiation, promotion, stimulation or enhancement of 
root initiation and/or development; (xi) modification of the initiation, promotion, 
stimulation or enhancement of lateral root initiation and/or development; (xii) modification 
of the initiation, promotion, stimulation or enhancement of nodule formation and/or nodule 

25 function; (xiii) modification of the initiation, promotion, stimulation or enhancement of the 
bushiness of the plant; (xiv) modification of the initiation, promotion, stimulation or 
enhancement of dwarfism in the plant; (xv) modification of the initiation, promotion, 
stimulation or enhancement of senescence; (xvi) modification of stem thickness and/or 
strength characteristics and/or wind-resistance of the stem and/or stem length; (xvii) 

30 modification of tolerance and/or resistance to biotic stresses such as pathogen infection; 
and (xviii) modification of tolerance and/or resistance to abiotic stresses such as drought 
stress or salt stress. 

Methods to effect expression of a CCP or a homologue, analogue or derivative 
thereof as defined in the present invention in a plant cell, tissue or organ, include either the 
35 introduction of the protein directly to a cell, tissue or organ such as by microinjection of 
ballistic means or, alternatively, introduction of an isolated nucleic acid molecule encoding 
the protein into the cell, tissue or organ in an expressible format. Methods to effect 
expression of a CCP or a homologue, analogue or derivative thereof as defined in the 
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current invention in whole plants include regeneration of whole plants from the 
transformed cells in which an isolated nucleic acid molecule encoding the protein was 
introduced in an expressible format. 

The present invention clearly extends to any plant produced by the inventive 
5 method described herein, and any and all plant parts and propagules thereof. The present 
invention extends further to encompass the progeny derived from a primary transformed or 
transfected cell, tissue, organ or whole plant that has been produced by the inventive 
method, the only requirement being that the progeny exhibits the same genotypic and/or 
phenotypic characteristic(s) as those characteristic(s) that (have) been produced in the 
1 0 parent by the performance of the inventive method. 

By "cell fate and/or plant development and/or plant morphology and/or 
biochemistry and/or physiology" is meant that one or more developmental and/or 
morphological and/or biochemical and/or physiological characteristics of a plant is altered 
. by the performance of one or more steps pertaining to the invention described herein. 
15 "Cell fate" includes the cell-type or cellular characteristics of a particular cell that are 
produced during plant development or a cellular process therefor, in particular during the 
cell cycle or as a consequence of a cell cycle process. 

The term "plant development" or the term "plant developmental characteristic" or 
similar terms shall, when used herein, be taken to mean any cellular process of a plant that 
20 is involved in determining the developmental fate of a plant cell, in particular the specific 
tissue or organ type into which a progenitor cell will develop. Cellular processes relevant 
to plant development will be known to those skilled in the art. Such processes include, for 
example, morphogenesis, photomorphogenesis, shoot development, root development, 
vegetative development, reproductive development, stem elongation, flowering, and 
25 regulatory mechanisms involved in determining cell fate, in particular a process or 
regulatory process involving the cell cycle. 

The term "plant morphology" or the term "plant morphological characteristic" or 
similar term will, when used herein, be understood by those skilled in the art to include the 
external appearance of a plant, including any one or more structural features or 
30 combination of structural features thereof. Such structural features include the shape, size, 
number, position, color, texture, arrangement, and patternation of any cell, tissue or organ 
or groups of cells, tissues or organs of a plant, including the root, stem, leaf, shoot, petiole, 
trichome, flower, petal, stigma, style, stamen, pollen, ovule, seed, embryo, endosperm, 
seed coat, aleurone, fibre, fruit, cambium, wood, heartwood, parenchyma, aerenchyma, 
35 sieve element, phloem or vascular tissue. 

The term "plant biochemistry" or the term "plant biochemical characteristic" or 
similar term will, when used herein, be understood by those skilled in the art to include the 
metabolic and catalytic processes of a plant, including primary and secondary metabolism 
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and the products thereof, including any small molecules, macromolecules or chemical 
compounds, such as but not limited to starches, sugars, proteins, peptides, enzymes, 
hormones, growth factors, nucleic acid molecules, celluloses, hemicelluloses, calloses, 
lectins, fibres, pigments such as anthocyanins, vitamins, minerals, micronutrients, or 
5 macronutrients, that are produced by plants. 

The term "plant physiology" or the term "plant physiological characteristic" or 
similar term will, when used herein, be understood to include the functional processes of a 
plant, including developmental processes such as growth, expansion and differentiation, 
sexual development, sexual reproduction, seed set, seed development, grain filling, asexual 
10 reproduction, cell division, dormancy, germination, light adaptation, photosynthesis, leaf 
expansion, fibre production, secondary growth or wood production, amongst others; 
responses of a plant to externally-applied factors such as metals, chemicals, hormones, 
growth factors, environment and environmental stress factors (e.g., anoxia, hypoxia, high 
temperature, low temperature, dehydration, light, daylength, flooding, salt, heavy metals, 
15 amongst others), including adaptive responses of plants to said externally-applied factors. 

The CCP molecules of the present invention are useful in agriculture. The nucleic 
acid molecules, proteins, protein homologues, and antibodies described herein can be used 
to modulate the protein levels or activity of a protein involved in the cell cycle, e.g., 
proteins involved in the Gl/S and/or the G2/M transition in the cell cycle due to 
20 environmental conditions, including abiotic stress such as cold, nutrient deprivation, heat, 
drought, salt stress, or biotic stress such as a pathogen attack. 

Thus, the CCP molecules of the present invention may be used to modulate, e.g., 
enhance, crop yields; modulate, e.g., attenuate, stress, e.g. heat or nutrient deprivation; 
modulate tolerance to pests and diseases; modulate plant architecture; modulate plant 
25 quality traits; or modulate plant reproduction and seed development. 

The CCP molecules of the present invention may also be used to modulate 
endoreduplication in storage cells, storage tissues, and/or storage organs of plants or parts 
thereof. The term "endoreduplication" includes recurrent DNA replication without 
consequent mitosis and cytokinesis. Preferred target storage organs and parts thereof for 
30 the modulation of endoreduplication are, for example, seeds (such as from cereals, oilseed 
crops), roots (such as in sugar beet), tubers (such as in potatoes) and fruits (such as in 
vegetables and fruit species). Increased endoreduplication in storage organs, and parts 
thereof, correlates with enhanced storage capacity and, thus, with improved yield. In 
another embodiment of the invention, the endoreduplication of a whole plant is modulated. 

35 

B. Screening Assays : 

The invention provides a method (also referred to herein as a "screening assay") for 
identifying modulators, i.e., candidate or test compounds or agents (e.g., peptides, 
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peptidomimetics, small molecules or other drugs) which bind to CCP proteins, have a 
stimulatory or inhibitory effect on, for example, CCP expression or CCP activity, or have a 
stimulatory or inhibitory effect on, for example, the expression or activity of a CCP 
substrate. 

5 In one embodiment, the invention provides assays for screening candidate or test 

compounds which are substrates of a CCP protein or polypeptide or biologically active 
portion thereof. In another embodiment, the invention provides assays for screening 
candidate or test compounds which bind to or modulate the activity of a CCP protein or 
polypeptide or biologically active portion thereof, e.g., modulate the ability of CCP to 

10 interact with its cognate ligand. The test compounds of the present invention can be 

obtained using any of the numerous approaches in combinatorial library methods known in 
the art, including: biological libraries; spatially addressable parallel solid phase or solution 
phase libraries; synthetic library methods requiring deconvolution; the 'one-bead one- 
compound' library method; and synthetic library methods using affinity chromatography 

15 selection. The biological library approach is limited to peptide libraries, while the other 
four approaches are applicable to peptide, non-peptide oligomer or small molecule libraries 
of compounds (Lam, K.S. (1997) Anticancer Drug Des. 12:145). 

Examples of methods for the synthesis of molecular libraries can be found in the 
art, for example in: DeWitt et al (1993) Proc. Natl. Acad. Set U.S.A. 90:6909; Erb et al 

20 (1994) Proc. Natl Acad. Set USA 91:1 1422; Zuckermann et aL (1994). J. Med. Chem. 
37:2678; Cho et al (1993) Science 261 :1303; Carrell et al (1994) Angew. Chem. Int. Ed. 
Engl. 33:2059; Carell et al (1994) Angew. Chem. Int. Ed. Engl 33:2061; and in Gallop et 
al (1994) J. Med. Chem. 37:1233. 

Libraries of compounds may be presented in solution (e.g., Houghten (1992) 

25 Biotechniques 13:412-421), or on beads (Lam (1991) Nature 354:82-84), chips (Fodor 

(1993) Nature 364:555-556), bacteria (Ladner USP 5,223,409), spores (Ladner USP '409), 
plasmids (Cull et al (1992) Proc Natl Acad Sci USA 89:1865-1869) or on phage (Scott and 
Smith (1990) Science 249:386-390); (Devlin (1990) Science 249:404-406); (Cwirla et aL 
(1990) Proc. Natl. Acad. Sci. 87:6378-6382); (Felici (1991) J. Mol Biol. 222:301-310); 

30 (Ladner supra.). 

In another embodiment, an assay is a cell-based assay comprising contacting a cell 
expressing a CCP target molecule (e.g. , a plant cyclin dependent kinase) with a test 
compound and determining the ability of the test compound to modulate (e.g. stimulate or 
inhibit) the activity of the CCP target molecule. Determining the ability of the test 

35 compound to modulate the activity of a CCP target molecule can be accomplished, for 

example, by determining the ability of the CCP protein to bind to or interact with the CCP 
target molecule, or by determining the ability of the target molecule, e.g., the plant cyclin 
dependent kinase, to phosphorylate a protein. 
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The ability of the target molecule, e.g., the plant cyclin dependent kinase, to 
phosphorylate a protein can be determined by, for example, an in vitro kinase assay. 
Briefly, a protein can be incubated with the target molecule, e.g., the plant cyclin 
dependent kinase, and radioactive ATP, e.g., [y-^2p] ATP, in a buffer containing MgCl2 
and MnCl2, e.g., 10 mM,MgCl2 and 5 mM MnCl2. Following the incubation, the 
immunoprecipitated protein can be separated by SDS-polyacrylamide gel electrophoresis 
under reducing conditions, transferred to a membrane, e.g., a PVDF membrane, and 
autoradiographed. The appearance of detectable bands on the autoradiograph indicates that 
the protein has been phosphorylated. Phosphoaminoacid analysis of the phosphorylated 
substrate can also be performed in order to determine which residues on the protein are 
phosphorylated. Briefly, the radiophosphorylated protein band can be excised from the 
SDS gel and subjected to partial acid hydrolysis. The products can then be separated by 
one-dimensional electrophoresis and analyzed on, for example, a phosphoimager and 
compared to ninhydrin-stained phosphoaminoacid standards. 

Determining the ability of the CCP protein to bind to or interact with a CCP target 
molecule can be accomplished by determining direct binding. Determining the ability of 
the CCP protein to bind to or interact with a CCP target molecule can be accomplished, for 
example, by coupling the CCP protein with a radioisotope or enzymatic label such that 
binding of the CCP protein to a CCP target molecule can be determined by detecting the 
labeled CCP protein in a complex. For example, CCP molecules, e.g., CCP proteins, can 
be labeled with 125 I, 35 S, 14 C, or 3 H, either directly or indirectly, and the radioisotope 
detected by direct counting of radioemmission or by scintillation counting. Alternatively, 
CCP molecules can be enzymatically labeled with, for example, horseradish peroxidase, 
alkaline phosphatase, or luciferase, and the ersymatic label detected by determination of 
conversion of an appropriate substrate to product. 

It is also within the scope of this invention to determine the ability of a compound 
to modulate the interaction between CCP and its target molecule, without the labeling of 
any of the interactants. For example, a microphysiometer can be used to detect the 
interaction of CCP with its target molecule without the labeling of either CCP or the target 
molecule. McConnell, H. M. et ah (1992) Science 257:1906-1912. As used herein, a 
"microphysiometer" (e.g., Cytosensor) is an analytical instrument that measures the rate at 
which a cell acidifies its environment using a light-addressable potentiometric sensor 
(LAPS). Changes in this acidification rate can be used as an indicator of the interaction 
between compound and receptor. 

In a preferred embodiment, determining the ability of the CCP protein to bind to or 
interact with a CCP target molecule can be accomplished by determining the activity of the 
target molecule. For example, the activity of the target molecule can be determined by 
detecting induction of a cellular second messenger of the target (e.g., intracellular Ca2+, 
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diacylglycerol, IP3, etc.), detecting catalytic/enzymatic activity of the target an appropriate 
substrate, detecting the induction of a reporter gene (comprising a target-responsive 
regulatory element operatively linked to a nucleic acid encoding a detectable marker, e.g., 
chloramphenicol acetyl transferase), or detecting a target-regulated cellular response. 
5 In yet another embodiment, an assay of the present invention is a cell-free assay in 

which a CCP protein or biologically active portion thereof is contacted with a test 
compound and the ability of the test compound to bind to the CCP protein or biologically 
active portion thereof is determined. Binding of the test compound to the CCP protein can 
be determined either directly or indirectly as described above. In a preferred embodiment, 

10 the assay includes contacting the CCP protein or biologically active portion thereof with a 
known compound which binds CCP to form an assay mixture, contacting the assay mixture 
with a test compound, and determining the ability of the test compound to interact with a 
CCP protein, wherein determining the ability of the test compound to interact with a CCP 
protein comprises determining the ability of the test compound to preferentially bind to 

15 CCP or biologically active portion thereof as compared to the known compound- 

In another embodiment, the assay is a cell-free assay in which a CCP protein or 
biologically active portion thereof is contacted with a test compound and the ability of the 
test compound to modulate (e.g., stimulate or inhibit) the activity of the CCP protein or 
biologically active portion thereof is determined. Determining the ability of the test 

20 compound to modulate the activity of a CCP protein can be accomplished, for example, by 
determining the ability of the CCP protein to bind to a CCP target molecule by one of the 
methods described above for determining direct binding. Determining the ability of the 
CCP protein to bind to a CCP target molecule can also be accomplished using a 
technology such as real-time Biomolecular Interaction Analysis (BIA). Sjolander, S. and 

25 Urbaniczky, C. (1991) Anal. Chem. 63:2338-2345 and Szabo et al (1995) Curr. Opin. 
Struct. Biol 5:699-705. As used herein, "BIA" is a technology for studying biospecific 
interactions in real time, without labeling any of the interactants (e.g., BIAcore). Changes 
in the optical phenomenon of surface plasmon resonance (SPR) can be used as an 
indication of real-time reactions between biological molecules. 

30 In an alternative embodiment, determining the ability of the test compound to 

modulate the activity of a CCP protein can be accomplished by determining the ability of 
the CCP protein to further modulate the activity of a CCP target molecule (e.g., a CCP 
mediated signal transduction pathway component). For example, the activity of the 
effector molecule on an appropriate target can be determined, or the binding of the effector 

35 to an appropriate target can be determined as previously described. 

In yet another embodiment, the cell-free assay involves contacting a CCP protein or 
biologically active portion thereof with a known compound which binds the CCP protein 
to form an assay mixture, contacting the assay mixture with a test compound, and 
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determining the ability of the test compound to interact with the CCP protein, wherein 
determining the ability of the test compound to interact with the CCP protein comprises 
determining the ability of the CCP protein to preferentially bind to or modulate the activity 
of a CCP target molecule. 

The cell-free assays of the present invention are amenable to use of both soluble 
and/or membrane-bound forms of proteins (e.g., CCP proteins or biologically active 
portions thereof). In the case of cell-free assays in which a membrane-bound form a 
protein is used it may be desirable^to utilize a solubilizing agent such that the membrane- 
bound form of the protein is maintained in solution. Examples of such solubilizing agents 
include non-ionic detergents such as n-octylglucoside, n-dodecylglucoside, n- 
dodecylmaltoside, octanoyl-N-methylglucamide, decanoyl-N-methylglucamide, Triton® 
X-100, Triton® X-l 14, Thesit®, Isotridecypoly(ethylene glycol ether) n , 3-[(3- 
cholamidopropyl)dimethylamminio]-l -propane sulfonate (CHAPS), 3-[(3- 
cholamidopropyl)dime1hylamminio]-2-hydroxy-l -propane sulfonate (CHAPSO), or N- 
dodecyl=N,N-dimethyl-3-ammonio-l -propane sulfonate. 

In more than one embodiment of the above assay methods of the present invention, 
it may be desirable to immobilize either CCP or its target molecule to facilitate separation 
of complexed from uncomplexed forms of one or both of the proteins, as well as to 
accommodate automation of the assay. Binding of a test compound to a CCP protein, or 
interaction of a CCP protein with a target molecule in the presence and absence of a 
candidate compound, can be accomplished in any vessel suitable for containing the 
reactants. Examples of such vessels include microtitre plates, test tubes, and micro- 
centrifuge tubes. In one embodiment, a fusion protein can be provided which adds a 
domain that allows one or both of the proteins to be bound to a matrix. For example, 
glutathione-S-transferase/ CCP fusion proteins or glutathione-S-transferase/target fusion 
proteins can be adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, 
MO) or glutathione derivatized microtitre plates, which are then combined with the test 
compound or the test compound and either the non-adsorbed target protein or CCP protein, 
and the mixture incubated under conditions conducive to complex formation (e.g. , at 
physiological conditions for salt and pH). Following incubation, the beads or microtitre 
plate wells are washed to remove any unbound components, the matrix immobilized in the 
case of beads, complex determined either directly or indirectly, for example, as described 
above. Alternatively, the complexes can be dissociated from the matrix, and the level of 
CCP binding or activity determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either a CCP protein or a CCP target 
molecule can be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated 
CCP protein or target molecules can be prepared from biotin-NHS (N-hydroxy- 
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succinimide) using techniques well known in the art (e.g., biotinylation kit, Pierce 
Chemicals, Rockford, IL), and immobilized in the wells of streptavidin-coated 96 well 
plates (Pierce Chemical). Alternatively, antibodies reactive with CCP protein or target 
molecules but which do not interfere with binding of the CCP protein to its target molecule 
can be derivatized to the wells of the plate, and unbound target or CCP protein trapped in 
the wells by antibody conjugation. Methods for detecting such complexes, in addition to 
those described above for the GST-immobilized complexes, include immunodetection of 
complexes using antibodies reactive with the CCP protein or target molecule, as well as 
enzyme-linked assays which rely on detecting an enzymatic activity associated with the 
CCP protein or target molecule. 

In another embodiment, modulators of CCP expression are identified in a method 
wherein a cell is contacted with a candidate compound and the expression of CCP mRNA 
or protein in the cell is determined. The level of expression of CCP mRNA or protein in 
the presence of the candidate compound is compared to the level of expression of CCP 
mRNA or protein in the absence of the candidate compound. The candidate compound can 
then be identified as a modulator of CCP expression based on this comparison. For 
example, when expression of CCP mRNA or protein is greater (statistically significantly 
greater) in the presence of the candidate compound than in its absence, the candidate 
compound is identified as a stimulator of CCP mRNA or protein expression. 
Alternatively, when expression of CCP mRNA or protein is less (statistically significantly 
less) in the presence of the candidate compound than in its absence, the candidate 
compound is identified as an inhibitor of CCP mRNA or protein expression. The level of 
CCP mRNA or protein expression in the cells can be determined by methods described 
herein for detecting CCP mRNA or protein. 

In yet another aspect of the invention, the CCP proteins can be used as "bait 
proteins" in a two-hybrid assay or three-hybrid assay (see, e.g., U.S. Patent No. 5,283,3 17; 
Zervos etal (1993) Cell 72:223-232; Madura et al (1993) J. Biol Chem. 268:12046- 
12054; Bartel et al. (1993) Biotechniques 14:920-924; Iwabuchi et al. (1993) Oncogene 
8: 1693-1696; and Brent WO94/10300), to identify other proteins, which bind to or interact 
with CCP ("CCP-binding proteins" or "CCP-bp") and are involved in CCP activity. Such 
CCP-binding proteins are also likely to be involved in the propagation of signals by the 
CCP proteins or CCP targets as, for example, downstream elements of a CCP-mediated 
signaling pathway. Alternatively, such CCP-binding proteins are likely to be CCP 
inhibitors. Alternatively, a mammalian two-hybrid system can be used which includes e.g. 
a chimeric green fluorescent protein encoding reporter gene (Shioda et al. 2000, Proc. Natl. 
Acad. Sci. USA 97, 5520-5224). Yet another alternative consists of a bacterial two-hybrid 
system using e.g. HIS as reporter gene (Joung et al. 2000, Proc. Natl. Acad. Sci. USA 97, 
7382-7387). 
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The two-hybrid system is based on the modular nature of most transcription factors, 
which consist of separable DNA-binding and activation domains. Briefly, the assay 
utilizes two different DNA constructs. In one construct, the gene that codes for a CCP 
protein is fused to a gene encoding the DNA binding domain of a known transcription 
5 factor (e.g. , GAL-4). In the other construct, a DNA sequence, from a library of DNA 

sequences, that encodes an unidentified protein ("prey" or "sample") is fused to a gene that 
codes for the activation domain of the known transcription factor. If the "bait" and the 
"prey" proteins are able to interact, in vivo, forming a CCP-dependent complex, the DNA- 
binding and activation domains of the transcription factor are brought into close proximity. 

10 This proximity allows transcription of a reporter gene (e.g., LacZ) which is operably linked 
to a transcriptional regulatory site responsive to the transcription factor. Expression of the 
reporter gene can be detected and cell colonies containing the functional transcription 
factor can be isolated and used to obtain the cloned gene which encodes the protein which 
interacts with the CCP protein. 

1 5 This invention further pertains to novel agents identified by the above-described 

screening assays. Accordingly, it is within the scope of this invention to further use an 
agent identified as described herein in an appropriate plant or animal model. For example, 
an agent identified as described herein (e.g., a CCP modulating agent, an antisense CCP 
nucleic acid molecule, a CCP-specific antibody, or a CCP-binding partner) can be used in 

20 a plant or animal model to determine the efficacy, toxicity, or side effects of treatment with 
such an agent. Alternatively, an agent identified as described herein can be used in a plant 
or animal model to determine the mechanism of action of such an agent. Furthermore, this 
invention pertains to uses of novel agents identified by the above-described screening 
assays for the agricultutal and therapeutic uses described herein. 

25 

C. Detection Assays 

Portions or fragments of the cDNA sequences identified herein (and the 
corresponding complete gene sequences) can be used in numerous ways as polynucleotide 
reagents. For example, these sequences can be used to: map their respective genes on a 

30 chromosome; and, thus, locate gene regions associated with genetic disease; identify an 
individual from a minute biological sample (tissue typing); and aid in forensic 
identification of a biological sample. Once the sequence (or a portion of the sequence) of a 
gene has been isolated, this sequence can be used to map the location of the gene on a 
chromosome. This process is called chromosome mapping. Accordingly, portions or 

35 fragments of the CCP nucleotide sequences, described herein, can be used to map the 
location of the CCP genes on a chromosome. The mapping of the CCP sequences to 
chromosomes is an important first step in correlating these sequences with genes 
associated with disease. 
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Briefly, CCP genes can be mapped to chromosomes by preparing PCR primers 
(preferably 15-25 bp in length) from the CCP nucleotide sequences. Computer analysis of 
the CCP sequences can be used to predict primers that do not span more than one exon in 
the genomic DNA, thus complicating the amplification process. These primers can then be 
5 used for PCR screening of cell hybrids containing individual plant or human 

chromosomes. Only those hybrids containing the plant or human gene corresponding to 
the CCP sequences will yield an amplified fragment. 

Other mapping strategies which can similarly be used to map a CCP sequence to its 
chromosome include in situ hybridization (described in Fan, Y. et aL (1990) Proc. Natl 

10 Acad Set USA, 87:6223-27), pre-screening with labeled flow-sorted chromosomes, and 
pre-selection by hybridization to chromosome specific cDNA libraries. 

Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase 
chromosomal spread can further be used to provide a precise chromosomal location in one 
step. Chromosome spreads can be made using cells whose division has been blocked in 

15 metaphase by a chemical such as colcemid that disrupts the mitotic spindle. The 

chromosomes can be treated briefly with trypsin, and then stained with Giemsa. A pattern 
of light and dark bands develops on each chromosome, so that the chromosomes can be 
identified individually. The FISH technique can be used with a DNA sequence as short as 
500 or 600 bases. However, clones larger than 1,000 bases have a higher likelihood of 

20 binding to a unique chromosomal location with sufficient signal intensity for simple 

detection. Preferably 1,000 bases, and more preferably 2,000 bases will suffice to get good 
results at a reasonable amount of time. For a review of this technique, see Verma et aL, 
Human Chromosomes: A Manual of Basic Techniques (Pergamon Press, New York 1988). 
Reagents for chromosome mapping can be used individually to mark a single 

25 chromosome or a single site on that chromosome, or panels of reagents can be used for 
marking multiple sites and/or multiple chromosomes. Reagents corresponding to 
noncoding regions of the genes actually are preferred for mapping purposes. Coding 
sequences are more likely to be conserved within gene families, thus increasing the chance 
of cross hybridizations during chromosomal mapping. 

30 Once a sequence has been mapped to a precise chromosomal location, the physical 

position of the sequence on the chromosome can be correlated with genetic map data. 
(Such data are found, for example, in V. McKusick, Mendelian Inheritance in Man, 
available on-line through Johns Hopkins University Welch Medical Library). The 
relationship between a gene and a disease, mapped to the same chromosomal region, can 

35 then be identified through linkage analysis (co-inheritance of physically adjacent genes), 
described in, for example, Egeland, J. et aL (1987) Nature, 325:783-787 r . 

Moreover, differences in the DNA sequences between plants affected and 
unaffected with a disease associated with the CCP gene, can be determined. If a mutation 
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is observed in some or all of the affected plants but not in any unaffected plants, then the 
mutation is likely to be the causative agent of the particular disease. Comparison of 
affected and unaffected plants generally involves first looking for structural alterations in 
the chromosomes, such as deletions or translocations that are visible from chromosome 
5 spreads or detectable using PCR based on that DNA sequence. Ultimately, complete 
sequencing of genes from several plants can be performed to confirm the presence of a 
mutation and to distinguish mutations from polymorphisms. 

D. Predictive Medicine : 

10 The present invention also pertains to the field of predictive medicine in which 

diagnostic assays, prognostic assays, and monitoring clinical trials are used for prognostic 
(predictive) purposes to thereby treat an individual prophylactically. Accordingly, one 
aspect of the present invention relates to diagnostic assays for determining CCP protein 
and/or nucleic acid expression as well as CCP activity, in the context of a biological 

15 sample (e.g., blood, serum, cells, tissue) to thereby determine whether an individual is 
afflicted with a disease or disorder, or is at risk of developing a disorder, associated with 
aberrant CCP expression or activity. The invention also provides for prognostic (or 
predictive) assays for determining whether an individual is at risk of developing a disorder 
associated with CCP protein, nucleic acid expression or activity. For example, mutations 

20 in a CCP gene can be assayed in a biological sample. Such assays can be used for 

prognostic or predictive purpose to thereby phophylactically treat an individual prior to the 
onset of a disorder characterized by or associated with CCP protein, nucleic acid 
expression or activity. 

Another aspect of the invention pertains to monitoring the influence of agents (e.g., 

25 drugs, compounds) on the expression or activity of CCP in clinical trials. 

E. Methods of Treatment : 

The present invention provides for both prophylactic and therapeutic methods of 
treating a subject at risk of (or susceptible to) a disorder or having a disorder associated 

30 with aberrant CCP expression or activity. With regards to both prophylactic and 

therapeutic methods of treatment, such treatments may be specifically tailored or modified, 
based on knowledge obtained from the field of pharmacogenomics. "Pharmacogenomics", 
as used herein, refers to the application of genomics technologies such as gene sequencing, 
statistical genetics, and gene expression analysis to drugs in clinical development and on 

35 the market. More specifically, the term refers the study of how a patient ! s genes determine 
his or her response to a drug (e.g., a patient's "drug response phenotype", or "drug response 
genotype".) Thus, another aspect of the invention provides methods for tailoring an 
individual's prophylactic or therapeutic treatment with either the CCP molecules of the 
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present invention or CCP modulators according to that individual's drug response 
genotype. Pharmacogenomics allows a clinician or physician to target prophylactic or 
therapeutic treatments to patients who will most benefit from the treatment and to avoid 
treatment of patients who will experience toxic drug-related side effects. 

This invention is further illustrated by the following examples which should not be 
construed as limiting. The contents of all references, patents and published patent 
applications cited throughout this application, as well as the Figures and the Sequence 
Listing are incorporated herein by reference. 
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EXAMPLES 

EXAMPLE 1: IDENTIFICATION OF PLANT CCP POLYPEPTIDES USING 

THE TWO HYBRID SYSTEM WITH CDC2B AS A BAIT 



A two-hybrid screening was performed using as bait a fusion between the GAL4 
DNA-binding domain and one of the following: CDC2bAt.N161 (GenBank accession 
number D10851; residue Aspl61 converted into Asnl61); CKSlAt (GenBank accession 

10 number AJ00001 6); E2Fa (=E2F5) (GenBank accession number AJ294534) dimerization 
domain (226-356aa; SEQ ID NO:205); CKI4 (SEQ ID NO:264); PLP1 (GenBank 
accession number T01601); KLPNT1 (GenBank accession number AB01 1479; protein ID 
number BAB 1 1568) motor domain (36-508 aa); KLPNT1 (GenBank accession number 
AB01 1479; protein ID number BAB1 1568) stalk domain (427-867 aa); KLPNT2=TH65 

15 (GenBank accession number AJ001729) neck domain (3-1 86 aa); KLPNT2=TH65 
(GenBank accession number AJ001729) stalk domain (73-608 aa); E2Fb (-E2F3) 
(GenBank accession number AJ294533) N-terminal domain (1-385 aa; SEQ ID NO:206), 
respectively 

CDC2bAt.N161 is a dominant negative form of the CDC2bAt protein. The D161 

20 residue in CDC2bAt is crucial for ATP binding and, thus, the mutation of this residue 

results in an inactive kinase. The interactions between this mutated CDK and its substrates 
and regulatory proteins are also more stabilised as a result of this mutation. 

In yeast the PHO genes are part of a complex regulatory network linking phosphate 
availability with the expression of phosphatases. When phosphate levels are high the 

25 PHO80/PHO85 cyclin/CDK complex phosphorylates a transcription factor. This 

transcription factor of phosphatase genes thereby becomes inactive. The S. cerevisiae 
PH085 protein can interact with the Gl specific cyclins PCL1 and PCL2 (close 
homologues to the PHO80). In a yeast strain deficient for the Gl cyclins CLN1 and CLN2, 
PHO80 is required for Gl progression. This result suggests that PH085 is involved in a 

30 regulatory pathway that links the nutrient status of the cell with cell division activity. The 
five PLP of A. thaliana show similarity to the yeast cyclin-like PHO80 gene. 

Kinesins use the cytoskeleton to move around vesicles, organelles, chromosomes 
and the like in the cell. They can also be involved in spindle formation. Kinesins consist 
of three functional unrelated domains: the motor domain (involved in microtubule binding; 

35 contains the ATPase domain), the stalk region (involved in homo- or heterodimirisation of 
the kinesins), and the tail (involved in the interaction with the 'substrates' of the kinesin). 
Two hybrid screens were performed using different parts of two-kinesin-related proteins 
(KLPNT1 and KLPNT2 (being more than 80% identical to KLPNT1). Other information 
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obtained by the two hybrid approach is the dimerization of the kinesins: the KLPNT1 and 
KLPNT2 interact (stalks and stalks-tail) with and between themselves. 

Vectors and strains used were provided with the Matchmaker Two-Hybrid System 
(Clontech, Palo Alto, CA). The bait was constructed by inserting the CDC2bAt.N161 
5 (GenBank accession number D10851; residue Aspl61 converted into Asnl61); CKS1 At 
(GenBank accession number AJ000016); E2Fa (=E2F5) (GenBank accession number 
AJ294534) dimerization domain (226-356aa; SEQ ID NO:205); CKI4 (SEQ ID NO:264); 
PLP1 (GenBank accession number T01601); KLPNT1 (GenBank accession number 
AB01 1479; protein ID number BAB1 1568) motor domain (36-508 aa); KLPNT1 

10 (GenBank accession number AB01 1479; protein ID number BAB1 1568) stalk domain 
(427-867 aa); KLPNT2=TH65 (GenBank accession number AJ001729) neck domain (3- 
186 aa); KXPNT2=TH65 (GenBank accession number AJ001729) stalk domain (73-608 
aa); E2Fb (=E2F3) (GenBank accession number AJ294533) N-terminal domain (1-385 aa; 
SEQ ID NO:206), respectively, into the pGBT9 vector. Bait vectors where constructed by 

15 introducing the PCR fragment created from the corresponding cDNA using primers to 
incorporate EcoRI and BamHl restriction enzyme sites. The PCR fragment was cut with 
EcoBI and BamHl and cloned into the EcoRl and BamHl sites of pGBT9, resulting in the 
desired plasmid. The GAL4 activation domain cDNA fusion library was constructed as 
described in De Veylder et al 1999, 208(4) p453-62 from mRNA of Arabidopsis thaliana 

20 cell suspensions harvested at various growing stages: early exponential, exponential, early 
stationary, and stationary phase. 

For the screening a 1 -liter culture of the Saccharomyces cerevisiae strain HF7c 
(MATaura3-52 his3-200 ade2-101 lys2-801 trpl-901 leu2-3,112 gal4-542 gal80-538 
LYS2::GALJ lJA s-GALl TATA -HIS3 URA3::GAL4i7 mers (3x)-CyClTATA-LacZ) was 

25 sequentially transformed with the bait plasmid and 20|ug DNA of the library using the 

lithium acetate method (Geitz et al (1992) supra). To estimate the number of independent 
cotransformants, 1/1000 of the transformation mix was plated on Leu- and Trp- medium. 
The rest of the transformation mix was plated on medium to select for histidine 
prototrophy (Trp-, Leu-, His-). After 5 days of growth at 30°C, the colonies larger than 2 

30 mm were streaked on histidine-lacking medium. At total for each screening at least 10 7 
independent cotransformants were screened for there ability to grow on histidine free 
medium. Of the His + colonies the activation domain plasmids were isolated as described 
(Hoffeian and Winston, 1987, Gene 57, 267-272). The hybriZAP™ inserts were PCR 
amplified and the PCR fragments were digested with^4/wl and fractionized on a 2% 

35 agarose gel. Plasmid DNA of which the inserts gave rise to different restriction patterns 
were electroporated into Escherichia coli XL 1 -Blue, and the DNA sequence of the inserts 
was determined. Extracted DNA was also used to retransform HF7c to test the specificity 
of the interaction. 
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Using the foregoing technique, 61 cDNAs were identified, their sequences were 
determined and found to contain open reading frames termed CCP1 through CCP61 
(Figures 1-61). 

EXAMPLE 2: EXTENSION OF CCP ENCODING POLYNUCLEOTIDES 

TO FULL LENGTH OR TO RECOVER REGULATORY 

ELEMENTS 

The CCP encoding nucleic acid sequences (SEQ ID NO: 1-66 or 228-239) are used 
to design oligonucleotide primers for extending a partial nucleotide sequence to full length 
or for obtaining 5' sequences from genomic or cDNA libraries. One primer is synthesized 
to initiate extension in the antisense direction (XLR) and the other is synthesized to extend 
sequence in the sense direction (XLF). Primers allow the extension of the known CCP 
encoding sequence "outward" generating amplicons containing new, unknown nucleotide 
sequence for the region of interest. The initial primers are designed from the cDN A using 
OLIGO® 4.06 Primer Analysis Software (National Biosciences), or another appropriate 
program, to be preferably 22-30 nucleotides in length, to have a GC content of preferably 
50% or more, and to anneal to the target sequence at temperatures preferably about 68°- 
72°C. Any stretch of nucleotides which would result in hairpin structures and primer- 
primer dimerizations is avoided. The original, selected cDNA libraries, prepared from 
mRNA isolated from actively dividing cells or a plant genomic library are used to extend 
the sequence; the latter is most useful to obtain 5* upstream regions. If more extension is 
necessary or desired, additional sets of primers are designed to further extend the known 
region. 

Sense XLF primers can also be designed based on publicly available genomic 
sequences. GENEMARK.hmm (hidden morkov model) version 2.2a software (default 
parameters) can e.g. be used to predict open reading frames. The 5' end of the predicted 
open reading frame is then subsequently used to design the sense XLF primer. Said XLF 
primer and the appropriate XLR primer are then used in an RT-PCR (reverse transcription- 
polymerase chain reaction) reaction to amplify the predicted cDNA. The resulting PCR 
product is cloned in a suitable vector and subjected to DNA sequence analysis to verify the 
prediction. 

Primers used to amplify coding regions of the CCPs of the invention are designed 
such that the PCR product can be cloned in the pDONR201 vector (Gateway™ cloning 
system, Invitrogen). Thus, a sense primer has the attBl site (SEQ ID NO:246) at its 5' end. 
For current purposes, the attB 1 site is followed by a consensus Kozak sequence (SEQ ID 
NO:247; Kozak (1989) J Cell Biol 108:229-241; Liitck et al (1987) EMBO J 5:43-48). 
The y end of the sense primer comprises the gene-specific parts as indicated in Figures 1- 
46. An antisense primer has at the 5' end the attB2 site (SEQ ID NO:248) followed by the 
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inverse complement of the gene/coding region of interest as indicated in Figures 1-46. 
Primers used for CCP amplification by PCR are given with their SEQ ID NOs in Table 3. 
The sequence of cloned CCP PCR products was or is determined using the sense primer 
prml024 (SEQ ID NO:265) and the antisense primer prml025 (SEQ ID NO:266). 

5 

TABLE IU: 



CCP 


PCR primers 
sense + antisense 


sense 
primer 
SEQ ID 

NO: 


antisense 
primer 
SEQ ID 
NO: 


CCP1 


prm0733 + prm0734 


133 


134 


CCP2 


prm0663 + prm0664 


135 


136 


CCP3 


prm0705 + pnn0706 


137 


138 


CCP4 


prm0659 + prm0660 


139 


140 


CCP5 


prm0749 + prm0750 


141 


142 


CCP6 


prm0707 + pnn 0708 


143 


144 


CCP7/8 


prm0657 + prm0658 


145 


146 


CCP9 


prm0582 + prm0583 


147 


148 


CCP10 


prm0671 + prm0672 


149 


150 


CCP11 


prm0729 + prm0730 


151 


152 


CCP12+ 
CCP 13 


prml676 + pnnl677 


153 


154 


CCP 14 


prm0701 + prm0702 


155 


156 


CCP 15 


prm0445 + prm0446 


157 


158 


CCP 16 


prm0321 + prm0322 


159 


160 


CCP 17 


prm0632 + prm0633 


161 


162 


CCP 18 


prm0488 + prm0489 


163 


164 


CCP 19 


prm0661 + prm0662 


165 


166 


CCP20+ 
CCP21 


prm0709 + pnn0710 


167 


168 


CCP22 


prm0711 +prm0712 


169 


170 


CCP23 


prm0819 + prm0820 


171 


172 ! 


CCP24 


prm0739 + pnn0740 


173 


174 


CCP25 


prm0741 +prm0742 


175 


I 176 


CCP26 


prm0703 + prm0704 


177 


178 


CCP27 


prm0817 + prm0818 


179 


180 


CCP28 


prm0713 +pnn0714 


181 


182 


CCP29 


/ 


/ 


/ 


CCP30 


prm0480 + prm0481 


183 


184 


CCP31 


prm0737 + prm0738 


185 


186 


CCP32 


prml493 +pnnl494 


187 


188 


CCP33 


pnn0319 + pnn0320 


189 


190 


CCP34 


pnnl377 + prml378 


191 


192 
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CCP35 


prml381 + prml382 


193 


194 


CCP36 


/ 


/ 


7 


CCP37 


prml379 + prml380 


195 


196 


CCP38 


pnnl383 + piml384 


197 


198 



By following the instructions for the XL-PCR kit (Perkin Elmer) and thoroughly 
mixing the enzyme and reaction mix, high fidelity amplification is obtained. Beginning 
with 40 pmol of each primer and the recommended concentrations of all other components 
5 of the kit, PCR is performed suing the Peltier Thermal Cycle (PTC200; MJ Research, 
Watertown MA) and the following parameters: 





Step 1 


94 °C for 1 min (initial denaturation) 




Step 2 


65°C for 1 min 


10 


Step 3 


68°Cfor6min 




Step 4 


94° for 15 sec 




Step 5 


65°C for 1 min 




Step 6 


68°C for 7 min 




Step 7 


Repeat steps 4-6 for 15 additional cycles 


15 


Step 8 


94°Cfor 15 sec 




Step 9 


65°C for 1 min 




Step 10 


68°C for 7:15 min 




Step 11 


Repeat step 8-10 for 12 cycles 




Step 12 


72°C for 8 min 


20 


Step 13 


4°C (and holding) 



A 5-10 |lx1 aliquot of the reaction mixture is analyzed by electrophoresis on a low 
concentration (about 0.6-0.8%) agarose mini-gel to determine which reactions were 
successful in extending the sequence. Bands thought to contain the largest products were 

25 selected and cut out of the gel. Further purification involves using a commercial gel 
extraction method such as QIAQuick™ (QIAGEN Inc). After recovery of the DNA, 
Klenow enzyme was used to trim single-stranded, nucleotide overhangs creating blunt 
ends which facilitate religation and cloning. After ethanol precipitation, the products are 
redissolved in 13 ^il of ligation buffer, l\xl T4-DNA ligase (15 units) and 1 \xl T4 

30 polynucleotide kinase are added, and the mixture is incubated at room temperature for 2-3 
hours or overnight at 16°C. Competent E. coli cells (in 40 jxl of appropriate media) are 
transformed with 3 \xl of ligation mixture and cultured in 80 of SOC medium 
(Sambrook, supra). After incubation for one hour at 37°C, the whole transformation 
mixture is plated on Luria Bertani (LB)-agar (Sambrook, supra) containing 2xCarb. The 
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following day, several colonies are randomly picked from each plate and cultured in 150 \il 
of liquid LB/2xCarb medium placed in an individual well of an appropriate, commerically- 
available, sterile 96-well microtiter plate. The following day, 5 \il of each overnight culture 
is transferred into a non-sterile 96-well plate and after dilution 1:10 with water, 5 jil of 
5 each sample is transferred into a PCR array. For PCR amplification, 1 8 jil of concentrated 
PCR reaction mix (3.3x) containing 4 units of 4Tth DNA polymerase, a vector primer and 
both of the gene specific primers used for the extension reaction are added to each well. 
Amplification is performed using the following conditions: 



10 Stepl 94°Cfor60sec 

Step 2 94°Cfor20sec 

Step 3 55°C for 30 sec 

Step 4 72°C for 90 sec 

Step 5 , Repeat steps 2-4 for an additional 29 cycles 

1 5 Step 6 72°C for 1 80 sec 

Step 7 4°C (and holding) 



Aliquots of the PCR reactions are run on agarose gels together with molecular weight 
markers. The sizes of the PCR products are compared to the original partial cDNAs, and 
20 appropriate clones are selected, ligated into plasmid and sequenced. 



EXAMPLE 3: EXPRESSION OF RECOMBINANT CCP PROTEINS IN 

TRANSGENIC PLANTS 

25 In this example, the CCP molecules of the present invention were expressed in a 

35S expression vector in transgenic plants. The CCP molecules of this invention were 
cloned using standard cloning procedures between a suitable promoter, e.g. the CaMV35S 
promoter or any promoter from e.g. Table II, and a suitable terminator, e.g., the NOS 3' 
untranslated region. The resulting recombinant gene is subsequently cloned in a suitable 

30 binary vector and the resulting plant transformation vector is then transferred to 

Agrobacterium tumefaciens. Arabidopsis thaliana is transformed with this Agrobacterium 
applying the in planta flower-dip transformation method (Clough and Bent, Plant J. 
75:735-743, 1998). Transgenic plant lines are selected on a growth medium containing the 
suitable selection agent (e.g., kanamycin or Basta) or on the basis of scoring the expression 

35 of a screenable marker (e.g., luciferase, green fluorescent protein). 

For tissue-specific expression, the CCP gene can also be expressed under control of 
the minimal 35 S promoter containing UAS elements. These UAS elements are sites for 
transcriptional activation by the GAL4-VP16 fusion protein. The GAL4-VP16 fusion 
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protein in turn is expressed under control of a tissue^specific promoter. The UAS-CCP 
construct and the GAL4- VP 16, construct are combinedby co-transformation of both 
constructs, subsequent transformation of single constructs or by sexual cross of lines that 
contain -the single constructs. The advantage of this two-component system is that a wide 
5 array of tissue-specific expression patterns can be generated for a specific transgene, by 
simply crossing selected parent lines expressing the UAS-CCP construct with various 
tissue-specific GAL4- VP 16 lines A tissue-specific promoter/CCP combination that gives 
a desired phenotype can subsequently be recloned in a single expression vector, to avoid 
stacking of transgene constructs in commercial lines. 
10 Primary transformants are characterized by Northern and Western blotting using 1 - 

4 week old plantlets. Expression levels were compared with those of non-transformed 
(control) plants. 

EXAMPLE 4: DOWNREGULATION OF TARGET CCP GENES IN 

15 TRANSGENIC PLANTS 

Plant genes can be specifically downregulated by antisense and co-suppression 
technologies. These technologies are based on the synthesis of antisense transcripts, 
complementary to the mRNA of a given CCP gene. There are several methods described in 

20 literature, that increase the efficiency of this downregulation, for example to express the 
sense strand with introduced inverted repeats, rather than the antisense strand. The 
constructs for downregulation of target genes are made similarly as those for expression of 
recombinant proteins, i.e., they are fused to promoter sequences and transcription 
termination sequences (see example 3). Promoters used for this purpose are constitutive 

25 promoters as well as tissue-specific promoters. 

EXAMPLE 5: AGROBACTERIUM-MEDIATED RICE TRANSFORMATION 

Mature dry seeds of the rice japonica cultivars Nipponbare or Taipei 309 are 
30 dehusked, sterilised and germinated on a medium containing 2,4-D (2,4- 

dichlorophenoxyacetic acid). After incubation in the dark for four weeks, embryogenic, 
scutellum-derived calli are excised and propagated on the same medium. Selected 
embryogenic calluses are then co-cultivated with Agrobacterium. Widely used 
Agrobacterium strains such as LBA4404 or C58 harbouring binary T-DNA vectors can be 
35 used. The hpt gene in combination with hygromycin is suitable as a selectable marker 
system but other systems can be used. Co-cultivated callus is grown on 2,4-D-containing 
medium for 4 to 5 weeks in the dark in the presence of a suitable concentration of the 
selective agent. During this period, rapidly growing resistant callus islands develop. After 
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transfer of this material to a medium with a reduced concentration of 2,4-D and incubation 
in the light, the embryogenic potential is released and shoots develop in the next four to 
five weeks. Shoots are excised from the callus and incubated for one week on an auxin- 
containing medium from which they can be transferred to the soil. Hardened shoots are 
5 grown under high humidity and short days in a phytotron. Seeds can be harvested three to 
five months after transplanting. The method yields single locus transformants at a rate of 
over 50 % (Aldemita and Hodges (1996) Planta 199:612-617; Chan et al (1993) Plant 
Mol Biol. 22: 491-506 ; Hiei eta!. (1994) Plant J. 6 :271-282). 

10 EXAMPLE 6: EXPRESSION OF RECOMBINANT CCP PROTEINS IN 

BACTERIAL CELLS 

In this example, the CCP molecules of the present invention are expressed as a 
recombinant glutathione-S-transferase (GST) fusion polypeptide in E. coli and the fusion 

15 polypeptide is isolated and characterized. Specifically, CCP molecules are fused to GST 
and this fusion polypeptide is expressed in E. coli, e.g., strain PEB199. Expression of the 
GST-CCP fusion protein in PEB199 is induced with IPTG. The recombinant fusion 
polypeptide is purified from crude bacterial lysates of the induced PEB199 strain by 
affinity chromatography on glutathione beads. Using polyacrylamide gel electrophoretic 

20 analysis of the polypeptide purified from the bacterial lysates, the molecular weight of the 
resultant fusion polypeptide is determined. 

EXAMPLE 7: EXPRESSION OF RECOMBINANT CCP PROTEINS 

IN COS CELLS 

25 

To express the CCP gene of the present invention in COS cells, the pcDNA/Amp 
vector by Invitrogen Corporation (San Diego, CA) is used. This vector contains an SV40 
origin of replication, an ampicillin resistance gene, an E. coli replication origin, a CMV 
promoter followed by a polylinker region, and an SV40 intron and polyadenylation site. A 

30 DNA fragment encoding the entire CCP protein and an HA tag (Wilson et aL (1984) Cell 
37:767) or a FLAG tag fused in-frame to its 3 1 end of the fragment is cloned into the 
polylinker region of the vector, thereby placing the expression of the recombinant protein 
under the control of the CMV promoter. 

To construct the plasmid, the CCP DNA sequence is amplified by PCR using two 

35 primers. The 5* primer contains the restriction site of interest followed by approximately 
twenty nucleotides of the CCP coding sequence starting from the initiation codon; the 3 f 
end sequence contains complementary sequences to the other restriction site of interest, a 
translation stop codon, the HA tag or FLAG tag and the last 20 nucleotides of the CCP 
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coding sequence. The PCR amplified fragment and the pCDNA/Amp vector are digested 
with the appropriate restriction enzymes and the vector is dephosphorylated using the 
CLAP enzyme (New England Biolabs, Beverly, MA). Preferably the two restriction sites 
chosen are different so that the Kinase and/or Phosphatase gene is inserted in the correct 
5 orientation. The ligation mixture is transformed into E. coli cells (strains HB101, DH5a, 
SURE, available from Stratagene Cloning Systems, La Jolla, CA, can be used), the 
transformed culture is plated on ampiciUin media plates, and resistant colonies are 
selected. Plasmid DNA is isolated from transformants and examined by restriction 
analysis for the presence of the correct fragment. 

10 COS cells are subsequently transfected with the CCP-pcDN A/Amp plasmid DNA 

using the calcium phosphate or calcium chloride co-precipitation methods, DEAE-dextran- 
mediated transfection, lipofection, or electroporation. Other suitable methods for 
transfecting host cells can be found in Sambrook, J., Fritsh, E. F. 9 and Maniatis, T. 
Molecular Cloning: A Laboratory Manual 2nd, ed., Cold Spring Harbor Laboratory, 

15 Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. The expression of 
the CCP polypeptide is detected by radiolabelling ( 35 S-methionine or 35 S-cysteine 
available from NEN, Boston, MA, can be used) and immunoprecipitation (Harlow, E. and 
Lane, D. Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, 1988) using an HA specific monoclonal antibody. Briefly, the cells 

20 are labelled for 8 hours with 35 S-methionine (or 35 S-cysteine). The culture media are then 
collected and the cells are lysed using detergents (RIPA buffer, 150 mM NaCl, 1% NP-40, 
0. 1 % SDS, 0.5% DOC, 50 mM Tris, pH 7.5). Both the cell lysate and the culture media 
are precipitated with an HA specific monoclonal antibody. Precipitated polypeptides are 
then analyzed by SDS-PAGE. 

25 Alternatively, DNA containing the Kinase and/or Phosphatase coding sequence is 

cloned directly into the polylinker of the pCDNA/Amp vector using the appropriate 
restriction sites. The resulting plasmid is transfected into COS cells in the maimer 
described above, and the expression of the CCP polypeptide is detected by radiolabelling 
and immunoprecipitation using a CCP specific monoclonal antibody. 

30 

EXAMPLE 8: IN VITRO PHOSPHORYLATION OF CDC2bDN-IC26M BY 

PLANT CDKs. 

The CDC2bDN-IC26M coding region (SEQ ID NO:4) was amplified by PCR with 
35 Pfu polymerase (Stratagene, La Jolla, CA). The PCR product was subcloned into pETl 9b 
(Novagen, Madison, WI), to obtain CDC2bDN-IC26MpET19b. The CDC2bDN-IC26M 
gene is located downstream of a T71ac promoter, in frame with a sequence encoding a 10- 
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histidine tag followed by an enterokinase recognition site. Escherichia coli BL21(DE3) 
cells (Novagen) containing the CDC2bDN-IC26MpET19b plasmid were grown at 37 °C in 
M9 medium (Sambrook and Russel, Molecular Cloning, A Laboratory Manual, 3 rd Edition, 
CSHL Press, CSHNew York, 2001), supplemented with 100 ^g/ml of ampicillin, to 
5 obtain a cell density corresponding to an A600 of 0.6. Subsequently, expression of the 
CDC2bDN-IC26M gene was induced by addition of 0.4 mM isopropyl P-D- 
thiogalactoside, and culture was continued for 4 h at 30 °C. 

Cells were collected in lysis buffer containing 50 mM sodium phosphate buffer, pH 
8.0, 300 mM NaCl, 0.1% Triton X-100, and 1 mM phenylmethylsulfonyl fluoride (PMSF) 
10 and were lysed on ice by sonication. The extract was clarified by centrifugation for 20 
minutes at 20,000 x g. The crude extract was loaded at 4 °C on a nickel-nitrilotriacetic 
acid-agarose affinity resin (Qiagen), and protein fractionation was performed according to 
the manufacturer's instructions. The fractions containing the CDC2bDN-IC26M fusion 
protein were pooled. 

15 CDC2bDN-IC26M kinase assays were performed with CDK complexes purified 

from total plant (Arabidopsis seedlings) protein extracts by pl3 sucl -Sepharose affinity 
binding according to Azzi et al (Eur. J. Biochem. 203: 353-360). Briefly, pU** 01 was 
purified from an overproducing E. coli strain by chromatography in Sephacryl S2000, and' 
conjugated to CNBr-activated Sepharose 4B (Pharmacia) according to the manufacturer's 

20 instructions. Total plant protein extracts (300 |ug) were incubated with 50 |il 50% (v/v) 
plS^-Sepharose beads for 2h at 4°C. The washed beads were combined with 30 \xl kinase 
buffer containing ~1 mg/ml CDC2bDN-IC26M, 150 mM ATP and 1 jxCi of [-32P]ATP 
(Amersham). After 20 minutes of incubation at 30°C, samples were analysed by SDS- 
PAGE and autoradiographed. 

25 As shown in Figure 48, the purified CDC2bDN-IC26M protein is phosphorylated 

by CDKs in vitro. 

EXAMPLE 9: PCR AMPLIFICATION OF AtDPb 

30 Based on available sequence data of putative plant DP-related partial clones from 

the databank (soybean DP (AI939068), tomato DP(AW217514), and cotton DP 
(AI731675)), three oligonucleotides, corresponding to the most conserved part of the 
DNA-binding and E2F heterodimerization domains (MKVCEKV, SEQ ID NO:240; 
LNVLMAMD, SEQ ID NO:241 and FNSTPFEL, SEQ ID NO:242), were synthesized and 

35 designated A (ATAGAATTCATGAAAGTTTGTGAAAAGGTG, SEQ ID NO:243), B 
(ATAGAATTCCTGAATGTTCTCATGGCAATGGAT, SEQ ID NO:244) and C 
(ATAGGATCCCAGCTCAAAAGGAGTGCTATTGAA, SEQ ID NO:245), respectively. 
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PCR was performed on an Arabidopsis/yeast two-hybrid suspension culture cDNA 
library. The PCR products were purified, digested with EcoRI and BamHI, and ligated into 
pCR-XL-TOPO vector (Invitrogen). The cloned inserts were sequenced by double- 
stranded dideoxy sequencing. 

5 

EXAMPLE 10: CONSTRUCTION OF AtDP and AtE2F MUTANTS, IN VITRO 

TRANSCRIPTION-TRANSLATION SYSTEM AND 
IMMUNOPRECIPITATION 

1 0 Influenza hemagglutinin (HA)-tagged versions of the wild-type and mutant AtE2Fa 

and AtE2Fb were constructed by cloning into the pSK plasmid (Stratagene) containing the 
HA-tag (SEQ ID NO:202). The AtE2F mutants, namely AtE2Fa 1-420 (SEQ ID NO:217), 
AtE2Fa 162-485 (SEQ ID NO:218) , and AtE2Fb 1-385 (SEQ ID NO:206) , were obtained 
by PCR and cloned into the .EcoRI and BamHI sites of HA-pSK. The c-myc (SEQ ID 

15 NO:200)-tagged versions of wild-type and AtDP mutants (AtDPa 1-292, SEQ ID NO:l 14; 
AtDPa 121-292, SEQ ID NO:21 1; AtDPa 1-142, SEQ ID NO:208; AtDPa 172-292, SEQ 
IDNO:213; AtDPa 121-213, SEQ IDNO:212; and AtDPb 1-385, SEQ IDNO:127; 
AtDPb 182-385, SEQ ID NO:216; AtDPb 1-263, SEQ ID NO:223; AtDPb 1-193, SEQ 
ID NO:214; and AtDPb 182-263, SEQ ID NO:215) were generated by PCR and cloned 

20 into the EcoRI and Pstl sites of the pBluescript plasmid (Stratagene) containing a double c- 
myc tag. All cloning steps were carried out according to standard procedures, and the 
reading frames were verified by direct sequencing. 

In vitro transcription and translation experiments were performed using the TNT 
T7-coupled wheat germ extract kit (Promega) primed with appropriate plasmids for 90 min 

25 at 30°C. For immunoprecipitation, 10 ul of the total in vitro translated extract (50 ul) was 
diluted at 1 :5 inNonidet P40 buffer (50 mM Tris, pH 7.4, 150 mM NaCl, 1% Nonidet P40, 
1 mM phenylmethylsulfonyl fluoride, 10 ug/ml leupeptin/aprotinin/pepstatin) and 
incubated for 2 h at 4°C with anti-c-myc (9E10; BabCo) or anti-HA (16B12; BabCo) 
antibodies. Protein-A-Sepharose (40 ul 25% (v/v)) was added and incubated for 1 h at 4°C, 

30 then the beads were washed four times with Nonidet P40 buffer. Immune complexes were 
eluted with 10 ul 2 U sodium dodecyl sulfate (SDS) sample buffer and analyzed by 10% or 
1 5% SDS-PAGE and by autoradiography. 

An overview of the AtDP and AtE2F fragments and their SEQ ED NOs is given in 
Table 4. 



35 
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T ABLE IV 



CCP or partial CCP 


SEQ ID NO 


PTA T1~\ "\TA 

SEQ ID NO 




ammo acid 


JJJNA 




sequence 


sequence 


AtEzra 226-356 


zUd 


ZZO 


AtEzFb 1-385 


206 




AtEzFb 1-127 


OAT 

207 




AtDPa 1-142 


208 




AtDPa 42-142 


209 




AtDPa 42-292 


O 1 /\ 

210 




AtDPa 121-292 


Oil 

211 


oor* 
229 


AtDPa 121-213 


O 1 o 

212 




AtDPa 172-292 


213 




AtDPb 1-193 


214 




AtDPb 182-263 


215 


230 


AtDPb 182-385 


216 


231 


AtE2Fa 1-420 


217 




AtE2Fa 162-485 


218 




AtE2Fa 1-38 


219 




AtDPa 1-214 


220 


239 


AtDPa 143-292 


221 


232 


AtDPa 143-213 


222 


233 


AtDPb 1-263 


223 


234 


AtE2Fa 232-282 


224 


235 


AtE2Fa 232-352 


225 


236 


AtE2Fb 194-243 


226 


237 


AtE2Fb 194-311 


227 


238 



EXAMPLE 11 : IN VITRO INTERACTION BETWEEN AtDPs, AtE2Fs AND 
5 MUTANTS THEREOF ILLUSTRATED BY 

IMMUNOPRECIPITATION EXPERIMENTS 

The AtDPa and AtDPb can efficiently interact in vitro with AtE2Fa and AtE2Fb. 
As a first step in comparing the biochemical properties of AtDPa and AtDPb, the ability of 

1 0 these molecules to heterodimerize with AtE2Fa and AtE2Fb was tested. For this purpose, 
the coupled in vitro transcription-translation system was used in which the o/wyc-tagged 
AtDPa or AtDPb was co-expressed with the HA-tagged AtE2Fa or AtE2Fb. One part of 
each sample was resolved by SDS-PAGE (Figures 50 and 51, panels A), while another part 
was subjected to immunoprecipitation with monoclonal anh-c-myc antibodies (Figures 50 

15 and 5 1 , panels B). In the absence of DP proteins, no AtE2F2a or AtE2F2b was 

precipitated by the anti-c-myc antibodies (Figure 51, panel B, lane 1). However, both HA- 
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AtE2F proteins co-precipitated reproducibly with c-/wyc-tagged AtDPa (Figure 50, panel 
B, lanes 1 and 2) and AtDPb (Figure 51, panel B, lanes 3 and 4). Identical results were 
obtained in a reciprocal experiment with anti-HA monoclonal antibodies. These data 
revealed that both Arabidopsis DP-related proteins interacted in vitro with the different 
Arabidopsis E2F-related proteins. 

The conserved dimerization domain of the AtE2Fs seemed to be important for the 
interaction with the AtDPs, because mutational analysis showed that deletion neither of the 
N-terminal extension nor the C-terminal part of AtE2Fa and AtE2Fb impaired the 
interaction with the DPs (Figures 50 and 51, panels B). Similar results were obtained by 
two-hybrid analysis (see Table 5 of Example 12). To test whether the structural 
requirements for heterodimerization of the AtDPs were similar to those of their animal 
homologs, several deletion mutants of AtDPa and AtDPb were constructed (for a 
schematic illustration, see Figures 52 and 53), tagged with the c-myc epitope (Figures 54 
and 55, panels A). The interactions between the mutant AtDPs and AtE2Fb were analyzed 
in immunoprecipitation experiments with the specific anti-HA or ant\-c-myc antibodies 
(Figures A6 and A7, panels B and C, respectively). As shown in Figures 54 and 55, 
mutant AtDP proteins with deleted DNA-binding domain could bind sufficiently to the co- 
translated HA-AtE2Fb proteins (Figure 54, panel C, lane 2; and Figure 55, panel C, lane 
2). No detectable interaction was found between the AtE2Fb protein and mutant DP 
proteins containing the complete DNA-binding domain, but lacking the putative 
dimerization domain (Figure 54, panel C, lane 3; Figure 55, panel C, lane 4). Thus, the N- 
tenninal part of both AtDP proteins, including the conserved DNA-binding domain, was 
not sufficient for the in vitro interaction to occur. In contrast, a mutant form of AtDPb 
(amino acids 1-263; SEQ ID NO:223) could bind to AtE2Fb (Figure 55, panel C, lane 3), 
indicating that the region of AtDPb between amino acids 182 and 263 was required for 
interaction with AtE2Fb. 

To confirm this hypothesis, a deletion mutant of AtDPb (182-263, SEQ ID 
NO:21 5) was constructed and, as expected, it could bind to AtE2Fb (Figure 56). The 
requirement for the homologous dimerization domain of AtDPa for the interaction with 
AtE2Fb was supported by a binding assay in which the mutant AtDPa 172-292 (SEQ ID 
NO:213) , with the N-terminal part of the dimerization domain deleted, failed to bind to 
AtE2Fb (Figure 54, panels B and C, lanes 4). However, when the E2F-binding activity of 
the predicted dimerization domain of the AtDPa (amino acid positions 121-213, SEQ ID 
NO:212) was tested, no interaction could be detected between this region and the AtE2Fb 
protein (Figure 54, panel B, lane 5). These data indicate that other carboxyl-terminal 
regions of AtDPa are required for the stable interaction with AtE2Fb. 
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EXAMPLE 12: YEAST TWO-HYBRID EXPERIMENTS FOR SHOWING 

INTERACTION BETWEEN DP AND E2F MUTANTS 

For library screening, vectors and strains (HF7c) were provided with the 

5 Matchmaker two-hybrid system (Clontech). The dimerization and DNA-binding domains 
of the AtE2Fa (amino acids 226-356; SEQ ID NO:205) were amplified by polymerase 
chain reaction (PCR) and subcloned in-fiame with the GAL4 DNA-binding domain of 
pGBT9 (Clontech) to create the bait plasmid pGBTE2Fa226-356. Screens were performed 
as described previously (De Veylder et al. 1999; Planta 208, 453-462). A second library 

10 screening was performed with the AtE2Fb construct (pGBTE2Fb-Rb) lacking the Rb- 
binding domain (amino acids 1-385; SEQ ID NO:206). Plasmids from interacting clones 
were isolated and sequenced. 

For the yeast two-hybrid interaction experiments, a number of yeast two-hybrid 
prey (in pAD-GAL424) plasmids were created by PCR amplification of fragments from 

15 the AtDPa (DPa 1-292, SEQ ID NO:l 14; DPa 1-142, SEQ ID NO:208; DPa 42-142, SEQ 
ID NO:209; DPa 42-292, SEQ IDNO:210; DPa 121-292, SEQ IDNO:211; DPa 121-213, 
SEQ ID NO:212; and DPa 172-292, SEQ ID NO:213) and AtDPb (DPb 1-385, SEQ ID 
NO:127; DPb 1-193, SEQ ID NO:214; DPb 182-263, SEQ ID NO:215; and DPb 182-385, 
SEQ ID NO:216 ) genes and confirmed by sequencing. Different combinations between 

20 bait (pGBTE2Fa226-3 56, pGBTE2Fb-Rb, or pGBTE2Fb 1-127, SEQ ID NO:207) and 
prey constructs were transformed into yeast cells and assayed for their ability to grow on 
His' minimal media after 3 days of incubation at 30°C. Bait plasmids co-transformed with 
empty pAD-GAL424 and prey plasmids co-transformed with empty pGBT9 were assessed 
along as controls for the specificity of the interaction. 

25 An overview of the AtDP and AtE2F fragments and their SEQ ID NOs is given in 

Table 4. 

The results obtained were confirmed by two-hybrid interaction analysis. 
pGBTE2Fa226-356 and pGBTE2Fb-Rb were co-transformed in an appropriate yeast 
reporter stain with a plasmid producing the full-length AtDPa or AtDPb protein fused to 
30 the GAL4 transactivation domain. The specific reconstimtion of GAL4-dependent gene 
expression measured as the ability to grow in the absence of histidine confirms the 
interaction between the two DP and E2F proteins (Table 5). 



35 
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EXAMPLE 13: RNA ISOLATION AND REVERSE TRANSCRIPTION-CRT)- 

PCR ANALYSIS OF AtDP And AtE2F EXPRESSION 

A. thaliana (L.) Heynh. cell suspension cultures were maintained as described 
5 previously (Glab et al. 1994, FEBS Lett. 17, 207-21 1). The cells were partially 

synchronized by the consecutive addition of aphidicolin (5 |^g/ml) and propyzamide (1 .54 
Hg/ml). The aphidicolin block was left for 24 hours. The cells were washed for 1 hour in 
B5 medium before the addition of propyzamide. Samples were taken at the end of the 24 
hour aphidicolin block, at the end of a 1 hour washing step, and at 1, 2, 3, and 4 hours after 

10 the addition of propyzamide to the culture medium. Total RNA was isolated from the 
Arabidopsis cell suspension culture according to Magyar et al. (1997), Plant Cell 9, 223- 
235, and with the Triazol reagent (Gibco/BRL) from different organs. Semi-quantitative 
RT-PCR amplification was carried out on reverse-transcribed mRNA, ensuring that the 
amount of amplified product stayed in linear proportion to the initial template present in 

15 the reaction. 1 0 jj.1 from the PCR was transferred onto Hybond-N/ membrane, hybridized 
to fluorescein-labeled gene-specific probes (Gene-Images random prime labeling module; 
Amersham Pharmacia Bio-tech), detected with the CDP-Star detection module 
(Amersham), and visualized by short exposure to Kodak X-OMAT autoradiography film. 
The following primer pairs (forward and reverse) were used for the amplification: 

20 5 ' -AT AGAATTC ATGTCCGGTGTCGTACGA-3 ' (SEQ ID NO:249, EcoRl site 
underlined) and 5 ' -ATAGGATCCCACCTCC AATGTTTCTGC AGC-3 ' (SEQ ID 
NO:250, BamHl site underlined) for AtE2Fa (GenBank accession number AJ294533); 
5' -AT AGAATTC GAGAAGAAAGGGCAAT CAAGA-3' (SEQ ID NO:251, EcoKL site 
underlined) and 5'-AT ACTGCAG AGAAATCTCGATTTCGACTAC-3 ' (SEQ ID 

25 NO:252, Pstl site underlined) for AtDPa (GenBank accession number AJ29453 1); 
5' -GCCACTCTCATAGGGTTCTC CATCG-3' (SEQ ID NO:253) and 5'- 
GGCATGCCTCC AAGATCCTTGAAGT-3 ' (SEQ ID NO:254) for Arath;CDKA;l 
(Genbank accesion number X57839); 5'-GGGTCTTGGTCGTTTTACTGTT-3 ' (SEQ ID 
NO:255) and 5'-CCAAGACGATGACAACAGATACAGC-3 5 (SEQ ID NO:256) for 

30 Arath;CDKB 1 ; 1 (Genbank accession number X57840); 

5 9 - AT AAACT AAATCTTCGCTG AA- 3' (SEQ ID NO:257) and 5'- 
CAAACGCGGATCTGAAAAACT-3 ' (SEQ ID NO:258) for histone H4 (Genbank 
accession number Ml 7132); 5' -TCTCTCTTCCAAATCTCC-3' (SEQ IDNO:259) and 
S'-AAGTCTCT CACTTTCTCACT-3' (SEQ ID NO:260) for ROCS (AtCYPl, GenBank 

35 accesion number U072676) (Chou and Gasser 1997, Plant Mol. Biol. 35, 873-892); 

5'- CTAAGCTCTC AAGATC AAAGGCTTA-3 ' (SEQ ID NO:261) and 5'-TTAACATTG 
C AAAGAGTTTC AAGGT-3 9 (SEQ ID NO:262) for actin 2 gene (GenBank accession 
number U41998) (An et al. 1996, Plant J. 10, 107-121). 
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EXAMPLE 14: THE AtDPa And The AtE2Fa GENES ARE CO-EXPRESSED 

IN A CELL CYCLE PHASE-DEPENDENT MANNER 

The identification of the AtDPa in a yeast two-hybrid screen as a gene encoding an 
5 AtE2Fa-associating protein indicated that it might act cooperatively in the plant cells as a 
functional heterodimer: To strengthen this hypothesis, we investigated whether both genes 
were co-regulated at the transcriptional level. Tissue-specific expression analysis revealed 
that both genes were clearly up-regulated in flowers and were very strongly transcribed in 
actively dividing cell suspension cultures (Figure 57). Expression in these tissues could be 

1 0 a sign for the correlation between the actual proliferation activity of a given tissue and the 
transcript accumulation, as can be seen from the Arath;CDKBl;l gene. AtDPa transcripts 
were also detectable in leaf and, to a lesser extent, in root and stem tissues, whereas 
AtE2Fa transcripts were virtually undetectable in roots and stem with only slight levels of 
expression in leaf tissues. Cell cycle phase-dependent gene transcription was studied using 

15 an Arabidopsis cell suspension that was partially synchronized by the sequential treatment 
with aphidicolin and propyzamide. The Arabidopsis histone H4 and the Arath;CDKBl;l 
gene were included to monitor the cell cycle progression (Figure 58) (Chaubet et al. 1996, 
Plant J. 10, 425-435; Segers et al. 1996, Plant J. 10, 601-612). Bearing in mind the partial 
synchronization of the culture, it can be observed that histone H4 transcript levels peaked 

20 immediately after the removal of the inhibitor and decrease gradually thereafter (Figure 
58). The opposite expression pattern could be observed for the Arath;CDKBl;l gene, 
illustrating that cells entered the G2-M phases with partial synchrony. Within this 
experimental setting, the AtDPa and the AtE2Fa genes show a very similar expression 
pattern. Both exhibit higher transcript accumulation before the peak of histone H4 gene 

25 expression and quickly decay in the following samples (Figure 58). The similarity in the 
expression patterns of Arabidopsis AtDPa and AtE2Fa supports the possibility that they act 
cooperatively as a heterodimer during the S phase. 

EXAMPLE 15: TRANSFORMATION OF ARABIDOPSIS THALIANA WITH 
30 CaMV35S::DPa 



Arabidopis plants were transformed (using the in planta flower dip method; Clough 
and Bent, Plant J. 75:735-743, 1998) with a construct containing the DPa gene under the 
control of the CaMV 35S promoter. The lines were molecularly analysed by northern 
35 blotting. As can be seen in Figure 59, all lines showed increased DPa levels in comparison 
with the untransformed control. Generally, two classes of lines were observed: weakly 
expressing (e.g., 16) and strongly expressing (e.g., 23) lines (see Figure 59). The plants are 
subsequently analyzed for phenotypic alterations as described herein. 
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Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
5 routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following 
claims. 
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What is claimed: 

1 . An isolated nucleic acid molecule selected from the group consisting of: 

5 (a) a nucleic acid molecule comprising the nucleotide sequence set forth in 

SEQ ID NOs:3, 6, 12, 13, 29, 41, 42, or 45. 

2. An isolated nucleic acid molecule which encodes a polypeptide comprising 
the amino acid sequence set forth in SEQ ID NOs:69, 72, 78, 79, 95, 108, or 1 1 1 . 

10 

3. An isolated nucleic acid molecule which encodes a naturally occurring 
allelic variant of a polypeptide comprising the amino acid sequence set forth in SEQ ID 
NOs:69, 72, 78, 79, 95, 108, or 1 11. 

15 4. An isolated nucleic acid molecule selected from the group consisting of: 

a) a nucleic acid molecule comprising a nucleotide sequence which is 
at least 60% identical to the nucleotide sequence of SEQ ID NOs:3, 6, 12, 13, 29, 41, 42, 
or 45, or a complement thereof; 
20 b) a nucleic acid molecule comprising a fragment of at least 50 

nucleotides of a nucleic acid comprising the nucleotide sequence of SEQ ID NOs:3, 6, 12, 
13, 29, 41, 42, or 45, or a complement thereof; 

c) a nucleic acid molecule which encodes a polypeptide comprising an 
amino acid sequence at least about 60% identical to the amino acid sequence of SEQ ID 

25 NOs:69, 72, 78, 79, 95, 1 08, or 1 1 1 ; and 

d) a nucleic acid molecule which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NOs:69, 72, 78, 79, 95, 108, or 1 1 1, 

' wherein the fragment comprises at least 15 contiguous amino acid residues of the amino 
acid sequence of SEQ ID NOs:69, 72, 78, 79, 95, 108, or 1 1 1 . 

30 

5. An isolated nucleic acid molecule which hybridizes to the nucleic acid 
molecule of any one of claims 1, 2, 3, or 4 under stringent conditions. 

6. An isolated nucleic acid molecule comprising a nucleotide sequence which 
35 is complementary to the nucleotide sequence of the nucleic acid molecule of any one of 

claims 1, 2, 3, or 4. 
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7. An isolated nucleic acid molecule comprising the nucleic acid molecule of 
any one of claims 1, 2, 3, 4, or 5, and a nucleotide sequence encoding a heterologous 
polypeptide. 

5 8. A vector comprising the nucleic acid molecule of any one of claims 1 , 2, 3, 

or 4. 

9. A cell comprising the nucleic acid molecule of any one of claims 1, 2, 3, or 

4. 

10 

10. A host cell transfected with the vector of claim 8. 

11. A method of producing a polypeptide comprising culturing the host cell of 
claim 10 in an appropriate culture medium to, thereby, produce the polypeptide. 

15 

12. An isolated polypeptide selected from the group consisting of: 

a) a fragment of a polypeptide comprising the amino acid sequence of 
SEQ ID NOs:69, 72, 78, 79, 95, 108, or 1 1 1, wherein the fragment comprises at least 15 

20 contiguous amino acids of SEQ ID NOs:69, 72, 78, 79, 95, 108, or 1 1 1 ; 

b) a naturally occurring allelic variant of a polypeptide comprising the 
amino acid sequence of SEQ ID NOs:69, 72, 78, 79, 95, 108, or 1 1 1 , wherein the 
polypeptide is encoded by a nucleic acid molecule which hybridizes to a nucleic acid 
molecule consisting of SEQ ID NOs:3, 6, 12, 13, 29, 41, 42, or 45 under stringent 

25 conditions; 

c) a polypeptide which is encoded by a nucleic acid molecule 
comprising a nucleotide sequence which is at least 60 % identical to a nucleic acid 
comprising the nucleotide sequence of SEQ ID NOs:3, 6, 12, 13, 29, 41, 42, or 45; 

d) a polypeptide comprising an amino acid sequence which is at least 
30 60% identical to the amino acid sequence of SEQ ID NOs:69, 72, 78, 79, 95, 1 08, or 1 1 1 . 

13. The isolated polypeptide of claim 12 comprising the amino acid sequence of 
SEQ ID NOs:69, 72, 78, 79, 95, 108, or 11 1. 

35 14. The polypeptide of claim 12, further comprising heterologous amino acid 

sequences. 



15. An antibody which selectively binds to a polypeptide of claim 12. 



WO 01/85946 



PCT/IB01/01307 



-104- 

16. A method for detecting the presence of a polypeptide of claim 1 2 in a 
sample comprising: 

a) contacting the sample with a compound which selectively binds. to 
the polypeptide; and 

b) determining whether the compound binds to the polypeptide in the 
sample to thereby detect the presence of a polypeptide of claim 12 in the sample. 

17. The method of claim 16, wherein the compound which binds to the 
polypeptide is an antibody. 

18. A kit comprising a compound which selectively binds to a polypeptide of 
claim 12 and instructions for use. 

19. A method for detecting the presence of a nucleic acid molecule of any one 
of claims 1, 2, 3, or 4 in a sample comprising: 

a) contacting the sample with a nucleic acid probe or primer which 
selectively hybridizes to the nucleic acid molecule; and 

b) determining whether the nucleic acid probe or primer binds to a 
nucleic acid molecule in the sample to thereby detect the presence of a nucleic acid 
molecule of any one of claims 1, 2, 3, or 4 in the sample. 

20. The method of claim 19, wherein the sample comprises mRNA molecules 
and is contacted with a nucleic acid probe. 

21. A kit comprising a compound which selectively hybridizes to a nucleic acid 
molecule of any one of claims 1, 2, 3, or 4 and instructions for use. 

22. A method for identifying a compound which binds to a polypeptide of claim 
12 comprising: 

a) contacting the polypeptide, or a cell expressing the polypeptide with 
a test compound; and 

b) determining whether the polypeptide binds to the test compound. 
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23 . The method of claim 22, wherein the binding of the test compound to the 
polypeptide is detected by a method selected from the group consisting of: 

a) detection of binding by direct detection of test 
compound/polypeptide binding; 

5 b) detection of binding using a competition binding assay; and 

c) detection of binding using an assay for CCP activity. 

24. A method for modulating the activity of a polypeptide of claim 1 2 
comprising contacting the polypeptide or a cell expressing the polypeptide with a 

10 compound which binds to the polypeptide in a sufficient concentration to modulate the 
activity of the polypeptide. 

25. A method for identifying a compound which modulates the activity of a 
polypeptide of claim 12 comprising: 

15 a) contacting a polypeptide of claim 12 with a test compound; and 

b) determining the effect of the test compound on the activity of the 
polypeptide to thereby identify a compound which modulates the activity of the 
polypeptide. 

20 26. A transgenic plant comprising the nucleic acid molecule of any one of 

claims 1, 2, 3, or 4. 

27. The transgenic plant of claim 26, wherein the plant is a monocot plant. 
25 28. The transgenic plant of claim 26, wherein the plant is a dicot plant. 

29. The transgenic plant of claim 26, wherein the plant is selected from the 
group consisting of arabidopsis thaliana, rice, wheat, maize, tomato, alfalfa, oilseed rape, 
soybean, sunflower, and canola. 

30 

30. A method for modulating the growth of a plant, comprising introducing into 
the plant a CCP modulator in an amount sufficient to modulate the growth of the plant, 
thereby modulating the growth of the plant. 

35 31. The method of claim 30, wherein the CCP modulator is a small molecule. 

32. The method of claim 30, wherein the CCP modulator is capable of 
modulating CCP polypeptide activity. 
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ss . The method of claim 32, wherein the CCP modulator is an anti-CCP 
antibody. 

-5 34. The method of claim 32, wherein the CCP modulator is a CCP polypeptide 

comprising the amino acid sequence of SEQ ID NOs: 67-132, 205, 211, 215-216 or 220- 
227, or a fragment thereof. 

35. The method of claim 30, wherein the CCP modulator is capable of 
10 modulating CCP nucleic acid expression. 

36. The method of claim 35, wherein the CCP modulator is an antisense CCP 
nucleic acid molecule. 

15 37. The method of claim 35, wherein the CCP modulator is a ribozyme. 

38. The method of claim 35, wherein the CCP modulator comprises the 
nucleotide sequence of SEQ ID NOs: 1-66 or 228-239, or a fragment thereof. 

20 39. The method of claim 30, wherein the plant is a monocot plant. 

40. The method of claim 30, wherein the plant is a dicot plant. 

41 . The method of claim 30, wherein the plant is selected from the group 
25 consisting of arabidopsis thaliana, rice, wheat, maize, tomato, alfalfa, oilseed rape, 

soybean, sunflower, and canola. 

42. A method for modulating the cell cycle in a plant, comprising introducing 
into the plant a CCP modulator in an amount sufficient to modulate the cell cycle in the 

30 plant, thereby modulating the cell cycle in the plant. 

43. The method of claim 42, wherein the plant is a monocot plant. 

44. The method of claim 42, wherein the plant is a dicot plant. 

35 

45 . The method of claim 42, wherein the plant is selected from the group 
consisting of arabidopsis thaliana, rice, wheat, maize, tomato, alfalfa, oilseed rape, 
soybean, sunflower, and canola. 
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A. 

CCP molecule: CCP1 nucleotide sequence (CDC2bDN-IC19): 

cttttaagttgggggatgtttcgattttgaaattt g^ 

ataaataacttccgcagataacgaagaagaagaaa|S2fe t ^? a ^ c ^^^^~^S ccttgga 
ttaatcggatcaatgagtctccaaggtaccctaaatcgatcgattttgttattaaaaatcaaaac 

tttcgttctctttgatttttcccccaaattgattttgaatttacttgatgtagggggaggagtag 

taggaaagatcaagacgacggcaacaacaggaccgacaagaagagcactaagtactattaacaag 

aacatcactgaagcgccgtcttacccttatgctgtcaacaagagatcagtttctgaaagagatgg 

catttgtaataaaccacctgtgcatcgaccagttactaggaagtttgctgctcagttagcagatc 

ataa accacatatccatgatgaggaaactaagaaaccaqactcagtttcaagt aaacraaccagag 

acgattatcattgatqtggatgaaagtga-taaagaaggaggtgactctaatga qccaatgtttgt: 

acaacatactgaagcaat.gctggaggagattgaacagatqqaqaaggaqattgaaatggaagatg 

caqacaaagaagaagagcctgtgatcgatatitqatgcctqtgataaqaataatcc tttggctgcg 

qttqaatatatccatgatatgcataccttctacaaqaattttqagaaacttagttgc qtgcctcc 

taactatatggacaatcaacaagatcttaatgagagaatqagaggaatcctca ttgactggttaa 

ttgaggtgcactacaaqtttgaactgatggaqgaaactctttatctcacaatcaa tqtcatcgac 

agattccttgcggttcatcaaatcgtgaggaaaaagcttcagcttqttggtgt tactgctttgtt 

qcttgcatqtaaatiatqaagaagtttcagttccaqtqqtaqatqatctcatcttgatctctgaca 

aagcttactctagaagagaagtgctagatatqgagaagctaatggccaacacctt qcaattcaat 

ttctctctaccaactccatatgttttcat:gaaacgatttct:caaaqctgccca at:ctgacaagaa 

gcttgagattttatcattctttatgatcgagctt-tgccttq-tggagtatqagatq ctaqagtatc 

ttccatctaagctggcgqcctcagcaatctacactgctcaqtqtacacttaaq qqatttgaagaa 

tqqagcaaaacctgtgaqtttcacacaggctacaacgaaaaacagctactggc atqtqcgagaaa 

qatggttgctttccatcacaaggcaggaacagggaaqctcacaqgagttca ca gaaag tacaaca 

catcta^ 

-tctaatatqacttaactcgagtttttctttagaacaaaaaqagtgtqaqagaaa qaqagatagta 
gagcaagttgcccaaaatgggagaagaatigqatctttaqatatcatqgcaag taqcccaaaaaga 
qtgtattcttctctttctaaggtctttaga-bctttcttcacttgagaqaqaata aaaaqaatctt 

ctgaaaaaaaaaaaaaaaaaaaaaa 



B. 

CCP molecule: CCP1 amino acid sequence (CDC2bDN-IC19): 

MVRSDE NSLGL IGSMSLQGTLNRSILLLKIKTFVLFDFSPKLILNLLDVGGGWGKIKTTATTGP 
T ir^VA^ff^ TTF.AP^YPYAVNKRSVSERDGICNKPPVHRPVTRKFAAQL^ 

PDSVSSEEPETIIIDVDESDKEGGDSNEPMFVQHTEAMLEEIEQMEKEIEMEDADKEEEPVIDID 
ACDKNNPLAAVEYIHDMHTFYKNFEKLSCVPPNYMDNQQDLNERMRGXLIDmilEVHYKFELMEE 
TLYLT I N VI DRFL AVHQ I VRKKLQL VG VT ALL LACKYEEVSVPWD DLILIS DKAYS RREVL DME 
KLMANTLQFN FS LPT P YVFMKRFLKAAQS DKKLE I L S FFMI ELCLVE YEMLE YLPSK LAAS AI YT 
AQCTLKGFEEWSKTCEFHTGYNEKQLLACARKMVAFHHKAGTGKLTGVHRKYNTSKFCHAARTEP 

AGFLI 



FIGURE 1 
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CCP molecule: CCP2 nucleotide sequence (CDC2bDNJC20): 

aacccacgtcaattctttttcaaaggcatatattctctct^ 

tccttctctgatcgttcgttttctggacgagagaggB^^ 

gattcgqgtactgctgctggtgggtcaaa^ 

cgttgaiigatgatccaacttgtctcatgatcttagagag.gatgcttatgacttqtc'Lctacagag 
taactaaatgtaacagagcagagagcgcattgtctctgcttcggaagaacaagaatggttttgat 
attgtGattagtgatgttcatatgcctgacatggatggtttcaagctccttgaacacgttggttt 
agagatggatttacctgttatcatgatgtctgcggatgattcgaagagcgttgtgttgaaaggag 
tgactcacggtgcagttgattacctcatcaaaccggtacgtattgaggctttgaagaatatatgg 
caacatg.tggtgcggaagaagcgtaac- g agtggaatg t ttctgaacattctqqaggaagtattg 
in CDC2bDN-IC20:c £ 

aagatactggcgg tgacagggacaggcagcagcagca-tagggaggatgctgataacaactcgtct 
tcagttaatgaagggaacgggaggagctcgaggaagcggaaggaagaggaagtagatgatcaagg 
ggatgataaggaagactcatcgagtttaaagaaaccacgcgtggtttggtctgttgaattgcatc 
agcagtttgttgctgctgtgaatcagctaggcgttgacaaagctgttcctaagaagatcttagag 
atgatgaatgtacccgggctaacgcgagaaaacgtagccagtcacctccagaagtatcggatata 
tctgagacggcttggaggagtatcgcaacaccaaggaaatatgaaccattcgtttatgactggtc 
aagatcagagttttggacctctttcttcgttgaatggatttgatcttcaatctttagctgttact 
ggtcagctccctcctcagagccttgcacagcttcaagcagctggtcttggccggcctacactcgc 
taaaccagggatgtcggtttctccccttgtagatcagagaagcatcttcaactttgaaaacccaa 
aaataagatttggagacggacatggtcagacgatgaacaatggaaatttgcttcatggtgtccca 
acgggtagtcacatgcgtctgcgtcctggacagaatgttcagagcagcggaatgatgttgccagt 
agcagaccagctacctcgaggaggaccatcgatgctaccatccctcgggcaacagccgatattgt 
caagcagcgtttcaagaagaagcgatctcactggtgcgctggcggttagaaacagtatccccgag 
accaacagcagagtgttaccaactactcactcggtcttcaataacttccccgcggatctacctcg 
cagcagcttcccgttggcaagtgccccagggatttcagttccagtatcagtlitcttaccaagaag 
aggtcaacagctcggatgcaaaaggaggttcatcagctgctactgctggat-ttggtaacccaagc 
tacgacatatttaacgattttccgcagcaccaacagcacaacaagaacatcagcaataaactaaa 
cgattgggatctgcggaatatgggattggtcttcagttccaatcaggacgcagcaactgcaaccg 
caaccgcagcattttccacttcggaagcatactcttcgtcttctacgcagagaaaaagacgggaa 
acggacgcaacagttgtgggtgagcatgggcagaacctgcagtcaccgagccggaatctgtatca 
tctgaaccacgtttt.tatggacggtggttcagtcagagtgaagtcagaaagagtggcggagacag 
tgacttgtcctccagcaaatacattgtttcacgagcagtataatcaagaagatctgatgagcgca 
tttctcaaacagg aag gcatcccatccgtagataacgagttcgaatttgacggatactccatcga 
taafeatccaggtcffiBc-bacagaaactcagactagactgcaagattctttgtttttcttctccct 
ccttcgaggtacaaagctcaaaacatggcaataaccgaagggaaagataga 
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CCP molecule: CCP2 amino acid sequence (CDC2bDN-IC20): 

MVNPGHGR GPDSGTAAGGSNSDPFPANLRVLWDDDPTCLMILERMLMTCLYRVTKCNRAESALS 
LLRKNKNGFDIVISDVHMPDMDGFKLLEHVGLEMDLPVIMMSADDSKSWLKGVTHGAVDYLIKP 
VRIEALKNIWQHVVRKKRNEW NVSEHSGGSIEDTGGDRDRQQQHREDADlSrNSSSVNEGMGRSSRK 
RKEEEVDDQGDDKEDSSSLKKPRWWSVELHQQFVAAVNQLGVDKAVPKKILEMMNVPGLTRENV 
ASHLQKYRIYLRRLGGVSQHQGNMNHSFMTGQDQSFGPLSSLNGFDLQSLAVTGQLPPQSLAQLQ 
AAGLGRPTLAKPGMSVSPLVDQRSIFNFENPKIRFGDGHGQTMNNGNLLHGVPTGSHMRLRPGQN 
VQSSGMMLPVADQLPRGGPSMLPSLGQQPILSSSVSRRSDLTGALAVRNSIPETNSRVLPTTHSV 
FNNFPADLPRSSFPLASAPGISVPVSVSYQEEVNSSDAKGGSSAATAGFGNPSYDIFNDFPQHQQ 
HNKNISNKLNDWDLRNMGLVFSSNQDAATATATAAFSTSEAYSSSSTQRKRRETDATWGEHGQN 
LQSPSRNLYHLNHVFMDGGSVRVKSERVAETVTCPPANTLFHEQYNQEDLMSAFLKQEGIPSVDN 

EFEFDGYSIDNIQV 
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A. 

£CP molecule: CCm 

a ggctgt 1 t 1 a t cgt nnn^ tttt t^a cgBficrCTQaaggaaakfcgct^tigt ct cggccattcac 
'tcgttcccttgc Ct ^ 

taaacacaaaaagaccagcctt.qqagqat:acaagagccactggaccGaacaaqaggaagaagcga 
gcggttctaggggagatcacaaatgttaactccaatacagctatacttgaqqccaaaaacagcaa 
gcagataaagaaaggacgcggtcatggattggcgagtacatcccagttqgcaacttctgttactt 
cagaagtcacagatcttcagtccaggaccgatgcaaaagttgaagttgcatcaaatacagcagqa 
aacctttctgtttctaaaggcacagataacacagctgataactg-battgagatatggaattctag 
attgcctccaagacctcttgggagatcagcttctacagctgagaaaagtgctgttattggtagtt 
caactgtaccggatatcccaaaatttgtagacatcgattcagatqacaaggatcctttactgtgc 
tgGCtcta-bgcccctgaaatccactacaatttgcqtgtttcagaqcttaaacgcagaccactticc 
ggactttatggagagaatacagaaggatgtcacccagtccatgcggggaattctggttgattggc 
ttgtggaggtctctqaagaatacacacttgcatctqacactctc-tacctcacaqtqtatctcata 
gaGtggttcctCGatggaaactacgtgcaaagacagcaacttcaac-tqctcggcatiGacttgcat 
gctaattgcctGgaagtatgaggaaatctctgctccacgcattgaggagttttgcttcattacgg 
ataacaGctacacaagagatcaggtcctggaaatggagaaccaagtacttaagcattttagcttt 
caaatatacac-bccGactccaaaaacgttccttaqgagatttctcaqaqcagctcaagcc'tGtcg 
CGtgagcccaagccttgaagtcqagtttctagGGagctatctaacagaqttqacat:taatagact 
acGatttcttaaagtttcttcctt-ccgttgttgctgcttcagcgg t ttttctcgccaag-bgqaca 

g (in CDC2t>DN-IC21) 
atggaccaatcaaaccacccatggaatccaacacttgagcattacacaacgtacaaagcatcgga 
tctgaaagcatctgttcatgGCttacaagatctgcagcttaacaccaaaggttgccccttgagcg 
ctatacgcatgaagtataggcaagagaaatacaaatctgtggcqgttctcacgtctccaaa^eta 



B. 

CCP molecule: CCP3 amino acid sequence (CDC2bDN-IC21): 

MGKENAVSRPFTRSLASALRASEVTSTTQNQQRVNTKRPALEDTRATGPNKRKKj^ ^^^^w^VN 
SNTAILEAKNSKQIKKGRGHGLASTSQLATSVTSEVTDLQSRTDAKVEVASNTAGNLSVSKGTDN 
TADNCIEIWNSRLPPRPLGRSASTAEKSAVIG5STVPDIPKFVDIDSDDKDPLLCCLYAPEIHYN 
LRVSELKRRPLPDFMERIQKDVTQSiM^ 

RQQLQLLGITCMLIASCTEErSAPRIEEFCFITDNTYTRDQVLEMENQVLKHFSFQIYTPTPKTF 
LRRFLRAAQASRLSPSLEVEFLAS YLTELTLIDYHFLKFLPSVVAASAVFLAKWTMDQSNHPWNP 
TLEHYTTYKASDLKASVHALQDLQLNTKGCPLSAIRMKYRQEKYKSVAVLTSPKLLDTLF 
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A. 

CCP molecule: CCP5 nucleotide sequence (CDC2bDN-IC39): 
ggcacgagaaaaaaaaE2§g^^actc^ 

cgattcttcaagatgaaacaagaagtagaaaattcggacaagagatgaagagggagaagagaaga 
gtgttgcgtgtgattaaccagaatctcgctggtgcaagagtttatccttgtgttgtcaacaagaa 
aggaagcttattgtctaataagcaagaagaagaagaaggatgtcaaaagaagaagtttgattctt 
tgcgtccttcagttacaagatctggagttgaggaagagactaacaagaagctgaagccctcagtt 
ccaagtgctaacgacttcggtgattgtatatttattgatgaggaggaagctacattggaccttcc 
aatgccaatgtcgcttgagaaaccatacattgaagctgatccaatggaagaagttgagatggagg 
atgtaacagtggaagaaccgatcgtggatatcgatgtcttagactcgaagaactcgcttgcggct 
gttgaatatgttcaagatctttacgcattttacagaacaatggagagat-ttagttgtgttccagt 
agactatatgatgcaacaaatcgacttaaacgagaagatgagagcaatactaatcgactggttaa 
tcgaggtacatgacaagtttgatctgatgaacgagacactgtttctgacagtgaatctgatagat 
agattcttgtccaagcaaaatgttatgagaaagaagcttcagcttgtagggttagtagctttgct 
gttagcttgtaagtatgaggaggtttcggttcctgttgtcgaagatttagtactcatttcggaca 
aagcgtatacgaggaacgatgttctagagatggagaaaacaatgttgagtactttgcaattcaat 
atctcgttaccgacacaatacccgttcttgaaaagattcctcaaggcagctcaagcagacaagaa 
gtgtgaggtcttggcgtcgttcttgatcgagcttgcccttgtggagtacgagatgcttcggtttc 
caccatcattactagctgccacatctgtgtacactgctcaatgtacacttgatggttccaggaaa 
tggaacagtacatgtgaattccattgtcattactctgaagaccagctcatggaatgttcacggaa 
gctggtgagtctgcatcagagggcggcgacaggaaacttaacaggagtatataggaagtacagca 
caagcaaatttggttacatagcaaaatgtgaagctgcacactttctag 

^ct^^tec^aaaggacagtagtaagtagtttgtacagcttcctgacatagttccctcattcact 
ctgtagcacaaataagaagaaacaaaaaaaaaagccaattaaatttgtcttatgattgattctgt 
ttttttgttgttactctttgttcacttcacttgtcagtattaaactctacaatgaatgataaatg 
attgaatcatttcattctttgttcagaatgaaatgtattttgtatcttatttgagctaaaaaaaa 
aaaaaaaaaaaaaactcgaggggggccccggtaccc 



B. 

CCP molecule: CCP5 amino acid sequence (CD C2bDN-IC3 9): 

MVNS CENKI FVKPTS TT I LQDETRS RKFGQEMKREKR{^2223SSS LAGARVYPC VVNKKGSLLS 
NKQEEEEGCQKKKFDSLRPSVTRSGVEEETNKKLKPSVPSANDFGDCIFIDEEEATLDLPMPMSL 
EKPYIEADP^EVEMEDVTV^EPIVDIDVLDSKNSLAAVEWQDLYAFYRTMERFSCVPVDY^ 

SFLIELALVEYEHLRFPPSLLAATS^ 

QRAATGNLTG VYRKYS T S KFG YI AKCE AAH FL VS E S HH S 
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A. 

COP molecule: COP4 nucleotide sequence s (G©C2bDN-lC26M): 

tttatgttgggattgcgacggtaaagttcacggcgctaatttcttggtagctaaacacacgcgtt 
gtcttctctgtagcgcttgtcagtctcttacgccgtggaaagctactgggcttcgtcttggccca 
actttctccgtctgcgagtcatgcgtcgctcttaaaaacgccggcggtggccgtggaaacagagt 
tttatcggagaatcgtggtcaggaggaggttaatagtttcgagtccgaagaagatcggatitagag 
aagatcacggtgacggtgacgacgcggagtcttacgatgatgatgaggaagaagatgaggatgaa 
gagtacagcgacgatgaggatgaggatgatgatgaggatggtgatgatgaggaagcggagaatca 
agttgtgccgtggtctgcggcggcgcaagttcctcGggtgatgagttcttcatcttctgaGggag 
gaagcggaggttcagtgacgaagaggacgagggctagagagaattcagatcttctctgctccgat 
gatgagatcggaagctctt cag ctcaagggtcaaactattctcggccgttgaagcgatcggcgtt 
taaatcaadgffil^^ 

cgtcttctccgatctttgcgatctccaaaacaagaagagatctcagccgttgattcc 



B. 

CCP molecule: CCP4 amino acid sequence (CDC2bDN-IC26M): 

MGKKCDLCNGVARMYCESDQASLCWDCDGKVHG^ 

TFSVCESCVALKNAGGGRGNRVLSENRGQEEVNSFESEEDRIREDHGDGDDAESYDDDEEEDEDE 
EYSDDEDEDDDEDGDDEEAENQWPWSAAAQVPPVMSSSSSDGGSGGSVTKRTRARENSDLLCSD 
DEIGSSSAQGSNYSRPLKRSAFKSTVW 
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CCP molecule: CCP6 nucleotide sequence (CDC2bPN-IC57): 

atttgagaggaagctttattttgtgtgtag33^ c 5^ 

gcattttgttcctgcagcttcacaaccttttcacccttatggacatgtacctccaaatgttcaaagtcagc 
ctccacagtattctcagccgatacagcagcagcagctctttccagtgagaccaggtcagcctgtgcatatt 
acatcatcctcacaggctgtatcagttccgtatattcaaacgaacaagattctcacttctggatctactca 
accacagccaaatgcacctccaatgacgggctttgctacatctggacctccattttcttctccatatactt 
ttgtaccatcatcttatectcagcaacaaccaacatccttggtccaaccaaattctcagatgcatgtagct 
ggcgtccctccagcagcaaacacttggcctgttcctgttaatcaaagcacatcacttgtttcccctgtgca 
gcagactgggcaacaaacaccggtcgcagtttccacagacccaggaaacttgactccgcaatctgcatctg 
actggcaggagcatacatctgctgatgggagaaaggctgatgcatccactgtatggaaggaatttacaaca 
cctgaaggaaagaaatattattataacaaggttacaaaggagtctaagtggacaattccggaagatttaaa 
gttagctcgggaacaagcccaactagctagtgaaaaaacgtccctttcggaagctggatctacccctctat 
cccaccatgctgcatcctcgtctgatctagcagttagcactgtgacttctgttgttcccagcacatcttca 
gcacttactggacattcttcaagccctattcaagcgggtttggctgtacctgtcacccgtcctccctctgt 
tgctcctgttactccaacatctggtgcaattagtgacactgaggctactacaatgtactatttttccttgg 
gaagttttgctgagaataaggaaatgtctgtgaatggaaaagccaatttgtcacctgctggtgacaaagca 
aatatcqaaaaacctatqqtatatqct actaaqcaqqagqccaaaqctgctttcaaqtctcttttgqaatc 
tqtaaatqttcattccqactqqacatqqqaacaqacattqaaaqaqattgttcacqataaaaqatatqgtg 
ctttqaqqacactcqqcqaqcqqaaacaagcgtttaacgaqtatcttqqccaaaqqaaaaaaqtqqaaqct 
qaqgaaaqacqaaqqaqqcaqaaqaaagctcgqqaaqaatttqtcaaqatqctagaqqaqtqtqaaqaact 
ttcatcatccctgaaatqqaqcaaaqcaatqaqtttqttcqaaaatqatcaqcqttttaaaqctqttqacc 
qtcctaqqqatcqtqaagatctttttqacaattacattgtqgaacttqaqaqqaagqaaaqaqaaaaqqca 
qcqqagqaacatcqqcagtatatggcaqactatcqqaagtttcttqaaacctqtqactatatcaaaqctgg 
tacacaatqqcgcaaaattcaaqataqactggaqqatgatqacaqa-tqctcatqtcttgaaaaqataqatc 
qtctgattqqttttqaqqaatacattcttqacctaqaqaaqgaaqaaqaaqaqctqaaqaqaqtagagaaa 
qaacatqtaaggcqqqccgaqaqaaaaaaccgtqatgcatttcqtacactattqqaaqaacatgttgctgc 
aqqcatccttacaqccaaqacqtactggttggattattqcattqaqttaaaaqacttqccccaataccaag 
ctqttgcatctaatacatctqgttcaactccgaaaqacttqtttgaaqatgtcacaqaagaattagagaag 

caqtatcatqaqqataaqaqctatqtqaaqqatqctatqaaqtcaaqaaag 

» in CDC2t>DN-IC57: atttccafcggtctactcgtg 

qcaa attttaaatctgctatttcaqaaqatctcagtactcaacaqatatcaqacataaatttaa 

gctgtttgaag ±n CDC2bDN-IC57 

aqcttatatatqatqacttqqttqqgaqagtgaaqqaaaaaqaaqaaaaaqaqqccagaaaqcttcaqcqt 

ctqgctgaaqaatttaccaatctqttqcacactttcaagq aaatcaccgtagcttcaaattgggaagatag 

caaacaactagtagaagaaagtcaagagtacagatcgattggagatgaaagtgttagccaagggttatttg 

aggaatacataacgagtttacaagaaaaggcaaaggagaaggagcgtaagcgtgacgaggaaaaggttaga 

aaagagaaggaaagggacgagaaagagaaacggaaagacaaggataaggagagaagggaaaaggaaagaga 

acgtgaaaaagagaagggaaaagagaggagtaaacgggaagaatcagatggtgagactgctatggatgtga 

gcgaaggtcataaagacgagaaaagaaagggaaaagatcgtgacagaaaacatcgaagacggcatcacaac 

aattctgatgaagatgtt;agttctgataggga.tgacagagatg;agtcgaagaaatcatcccgtaaacatgg 

taatgatcgcaaaaaatcaagaaagcacgcaaactcgcctgaatcggagagtgaaaacc^^ 

agaaaaaagagagtagtcgccgaagtggtaatgacgagctagaggatggagaa^ 

aattcgacattaatctgaaacctt 
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GCPjmoleciile:*GC^^ 

;MANNBRQSS^^^^ 

QAVS VP YI QTNKI LT S G S T.QPQENAPBMTG FATS GP PES SB^ 

VAGVPPAANTWPVPVNQSTSLVSPV.QQTGQQTPYAVSTDPGNLTPQSASDWQEHTSADGRKADAS 
TWKEETTPEGKKYYYNKVTKESKWTIPEDLKLAREQAQLASEKTSLSEAGSTPLSHHAASSSDL 
AVSTVTSWPSTSSALTGHSSSPIQAGLAYPVTRPPSVAPVTPTSGAISDTEATTMYYFSLGSFA 
ENKEMSVNGKANLSPAGDKANVEEPMVYATKQEAKAAFK^ 

YGALRTLGERKQAFNEYLGQRKKVEi^ERRRRQKKAREEFVKMLEECEELSSSLKWSKAMSLFEN 
DQRFKAVDRPRDREDLFDNYIVELERKEREKAAEEHRQYMADYRKFLETCDYIKAGTQWRKIQDR 
LEDDDRCSCLEKIDRLIGFEEYILDIjEKEEEELKRVEKEHVRRAERKNRDAFRTLLEEHVAAGIL 
TAKTYWL DYC I E-LKDLPQYQ A VASNTS GS T PKDL F-E DVTEELEKQ YHE DKS YVKDAMKS RK 

AN FKSAISEDLSTQQISDINLKLIYDDLVGRVKEKEEKEARKLQRLAEEFTNLLHT 

ISMVSSWLFED ±n CDC2bDN-IC57 — ^ 

FKEITVASNWEDSKQLVEESQEYRSIGDESVSQGLFEEYITSLQEKAKEKERKRDEEKVRKEKER 
DEKEKRKDKDKERREKEREREKEKGKERSKREESDGETAMDVSEGHKDEKRKGKDRDRKHRRRHH 
NN S DE D VS S DRD DRDE S KKS S RKHGN DRKKS RKHANS PE S E S ENRHKRQKKE S S RRS GN DE LE D G 
EVGE 
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A. 

CCP molecule: CCP7/CCP8 nucleotide sequence (CDC2bDN-IC62/E2F3ca55): 

tgaaacctagatttctgcaactgaattcctaattcgaaaaagag^ 

tagcaaggaagacatgggaactagagaacagcattctaacagtagactcacctga ttcaacctcc 

tgatcctcactactttaaacgagtcaagatctcagcgctcgctcttcttaagatg gtggttcacg 
ctcgctctacftggtacaattgaaataatgggtcttatgcaaggtaagac caatggtgatactatc 
attgttatQQatgcttttgctttaccagtggaaggtactgagacaagggttaatgctcaggatga 
tcrcttatcracrtaca tagttgagtattcacagaccaacaagctcgcggggc -aactggagaatgtt 

in CDC2bDN-IC62: c 

- in E2F3ca55 

Qttggatggtatcactctcaccctggatatggatgctggctctccggtatt aatgtttctacgca 
craggcttaaccaacagcatcaggagccatttttagctgttgttattgatcc cacaaggactgttt 
ca gctggta a ggt tga gattggtgctttcagaacatactctaaaggatataa g- -cctccagatg 

" in CDC2bDN-IC62: age 

in E2F3ca55: g — 

aacctgtttctgagtatcaaa - ctattcctttaaat aagattgaggactttggtgttcactgcaa 

a in CDC2bDN-IC62 
in E2F3ca55 

acagtactattcattagatgtcacttatttcaagtcatctcttgattctcaccttctggatctac 
tatggaacaagtactgggtgaacactctttcttcttctccactgctgggtaatggagactatgtt 
gctggacaaatatcagacttagctgagaagcttgagcaagccgagagtcatctggttcagtctcg 
ctttggaggagttgtgccatcatcccttcataagaaaaaagaagatgagtctcaactaactaaga 

gr in E2F3ca55 

taactcgggatagcgcaaagataactgtggaacaggtccatggactaatgtcgcaggtcataaaa 
qatgaattattcaact caa tgcgtcagtccaacaacaaatctcccactgactcgtcggatccaga 

ccctatgattacatadE^grl^^ 

in E2F3ca55: gr 

ttgtcctttcatttattttctgttgtgtaaagaattataatgctaatcagaataatacagaagaa 
gattttggttaaaaaaaaaaaaaaaaaaa 

B. 

CCP molecule: CCP7/CCP8 amino acid sequence (CDC2bDN-IC62/E2F3ca55): 

jv/fp cqqqttsr pttwrt .F.NS T T .T VDSPDS TSDNI FYYDDTSOTRF QQEKPWENDPHYFKRVK I SAIjAJjLKMV 
VHARSGGTIE IMGLMQGKTDGDTI I VMDAFALPVEGTE TRVNAQDDA YE YMVE YSQTNKLAG RLENVVGW 
YHSHPGYGCWLSGIDVS TQTLNQQHQEPFLAWIDPTR TVSAGKVEIGAFRTYSKGYKP P DE PVSEYQTI 
PLNKIEDFGVHCKQ YYSLDVTYFKSSLDSHLLDLL WNKYWVNTL SSS PLLGN GDYVAGQI SDLAEKLEQA 
r c HLVQSRFGGWPBSLHKKKEDESQL TKI TRDSAKI TVEQVHGLMS QVIKDELFNSMRQSNNKSPTDSS » 

1PMTTV 



esh: 

DPDPMITY 
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A. 

CCP molecule: CCP9 nucleotide sequence (CDC2bDN-IC9): 

ggcacgagtctctctctctctggagcgttctcttctctcctt:gagcttctcttaccgccattaga 

gctccttcacaaactcataaacctatttgttgagccaggcttggcttaaccactggcctttttcc 

a ga c t a a a 1 1 g^gta t tgc fccfcfcc t tcgatgcatccaaatgcaaacaaagaaaatatctctactt 

cagatgtacaggagagttttgtacgaataacgagatcacgagctaaaaaagccatgggaagagga 

gtatcaatacctccaacaaaaccttcttttaaacagcaaaagagacgtgcagtacttaaggatgt 

gagtaatacctctgcagatat-tatttattcagaacttcgaaagggaggcaacatcaaggcaaaca 

gaaaatgtctaaaagagcct:aaaaaagcagcaaaggaaggtgctaacagtgccatggatat"tctg 

gtagatatgcatacagaaaaatcaaaattagcagaagatttgtccaagatcaggatggctgaagc 

ccaagatgtctctctttcaaactttaaagatgaagaaattactgagcaacaagaagatggatcag 

gtgtcatggagttacttcaagttgtagatattgattccaacgtcgaagatccacagtgttgcagc 

ttgtatgctgctgatatatatgacaacatacatgttgcagagcttcaacaacgacccttggctaa 

ttatatggagcttgtgcagcgagatatcgacccagacatgagaaagattctgattgactggcttg 

tagaagtttctgacgactacaagctggttccagatacgctttaccttacagtgaatcttatcgac 

cggtttctgtccaacagttacattgaaaggcaaagactccagctccttggtgtctcttgcatgct 

tatagcttcaaaatatgaagagctttccgcaccaggggtggaggagttttgcttcattacggcca 

acacatacacaagacgagaagtgctgagcatggagattcaaattctaaattttgtgcactttaga 

ttatcggttcctaccaccaaaacatttctgaggcggttcattaaagcagctcaagcttcgtacaa 

ggtgcctttcattgaactggagtatttagcaaactatctcgccgaattgacactggtggaatata 

gtttcctaaggttcctgccatcactaattgctgcttcagctgttttcctagcccgatggacactc 

gaccaaactgaccatccttggaaccctactctgcaacactacaccagatatgaggtagctgagct 

gaagaacacagttctcgccatggaggacttgcagctcaacaccagtggctgtactctcgctgcca 

cccgtgagaaatacaac caacc aaagtt:taagagcgtggcaaagctgacatctcccaaacgagt:c 

acattalc^££offc^g ^ 

tacctattVtcagacattgg 

agctattgtgtggcgccttaattgtaggccattacttgtcacaccacttagctttaaataaatg.t 
tatggaatttttctaatcgcattgctacaactatttactatcctgcgggattttgtacctaggag 
cacttggaaaacgaatacaaaagtgttaattaatataaatttcactgttcatggcaaaaaaaa 



B. 

CCP molecule: CCP9 amino acid sequence (CDC2bDN-IC9): ^^^^^^^ 

MYCS S S MHPNANKENI S TS DVQES FVRI TRSRAKKAMGRGVS I PPTKPS FKQQKRjgggggSEggfr 

SADIIYSELRKGGNIKANRKCLKEPKKAAKEGANSAMDILVDMHTEKSKLAEDLSKIRMAEAQDV 
SLSNFKDEEITEQQEDGSGViyiELLQWDIDSNVEDPQCCSLYAADIYDNIHVAELQQRPLANtl^ 
LVQRDI DPDMKB^I*IDWLVEVS DDYKLVPDTLYLTVNLI DRFLSNSYIERQRLQLLGVSCMLIAS 
!KXEEt*SA^GVEE FCF I T ANT YTRRE VL SME I Q I LNFVH FRLSVPTT KT FLRRF I KAAQAS YKVPF 
IELEYLANYLAELTLVEYS 

VLAMEDLQLNTSGCTLAATREKYNQPKFKSVAKLTSPKRVTLLFSR 
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A. 

CCP molecule: CCP10 nucleotide sequence (CKSBC001): 

cgacatcttctaagaaaga aac aaagaaagacttcacat-tttaccattatttgctctgagctcag 
taggagagttcaagaaaca gjg gca^ga^^ 

tcgtctgctctgcatctgctaaaaccgcaagccctccagctccagtgctgccaccgacaccagct 
ccagcaccagccccggaaaatgtgaatctcaccgagcttttaagtgtagctggtccgttccacac 
attcctcgactaccttctctcgactggagtcattgagactttccaaaaccaagctaacaacactg 
aggaaggcatcacaatctttgtccctaaagatgatgctttcaaagctcagaagaatcctcctttg 
tcaaatctcacaaaggatcagcttaagcagcttgttctcttccatgctctgcctcattactattc 
gctttcggaattcaagaacttgagccaatctggtccagtgagcacctttgctggtggtcaatact 
ccttgaaattcactgatgtttctggcacggttaggattgattctttatggaccaggactaaagtc 
agcagcagtgttttctccactgaccctgttgcggtttaccaagtgaaccgcgtgcttctacccga 
agcaatctttggtactgatgtccctccaatgcctgct:ccagctcctgctcctat.cgttagtgctc 
cttcggattctccttcagttgctgattctgaaggagcttcttcaccaaagtcctcacacaagaac 
tccggacaaaagctgctacttgcaccaatctccatggttatttccggtttggtggcati^tctt; 
gg^t^^t^tlttgcagattgagttatgtttttaagttacaatgtgaaagattgtattacat 
catttgaattgtcttt-ttgatttttgaaacccattttttattatacat-ttttatcattattattg 
tttgtcattacgattgttgtgaattgaaattgttcctccaaaaaaaaaaaaaaaaaaaa 



B. 

CCP molecule: CCP10 amino acid sequence (CKSBC001): 

MAKMQLSIFIAWALIVCSASAKTASPPAPVLPPTPAPAPAPENVTSfLTELLSVAGPFHTFLDYLL. 
STGVIETFQNQANNTEEGITIFVPKDDAFKAQKNPPLSNLTKDQLKQLVLFHALPHYYSLSEFKN 
LSQSGPVSTFAGGQYSLKFTDVSGTVRIDSLWTRTKVSSSVFSTDPVAVYQVNRVLLPEAIFGTD. 
VPPMPAPAPAPIVSAPSDSPSVADSEGASSPKSSHKNSGQKLLLAPISMVISGLVALFL 
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A- 

*C3GPmolecule:lOGKtl 4 inicleotide sequence (GKSBG01#: 

cttaaa cta catttatcattacagtct 
a.gaatcgBggctett^^ 

cgtcca.g^gctgatgtcgaaaactcacagaagaaaaatggttacgcaaagaagatcgattgt 

gtgcgtgtgtagcacggtgcaggctttcgaggaggccgaggctgtgtcacagagcgtgc^ggac 

ttgctgct^a^ 

acdc{^cctcaccacccacaataaacqccqcaaqtqcccaB^ 

gctgcggataatgggacgatgtcgttttgttagtatttactttgg 

aataaacgagaacgtacgttgtcgttgtgagtgtgagtactgtattattaatggttctatttgtt 
tttacttgcaagttttcttgttttgaatttgtttttttcatatttgtatatcgattcgtgcatta 
ttgtattatttcaatttgtaataagattatgttacctttgagtggttgtttaaaaaaaaaaaaaa 
aaaa 



B. 

CCP molecule: CCP11 amino acid sequence (CKSBC011): SEQ ID NO:77 

MAISKALIASFLISLLVLQLVQADVENSQKPCNGYAKKIDCGSACVARLQAFEEAEAVSQSVRDLL 
LQVQLCASGYVRKLRQVPVLR 

c. 

CCP molecule: CCP11 amino acid sequence (CKSBCOll): SEQ ID NO:110 

MAI SKALIASLL I SLLVLQLVQADVENSQKKNG YAKKI DCGSACVARCRLSRRPRLCHRACGTCC 
YRCNCVPPGTYGNYDKCQCYASLTTHGGRRKCP 
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A. 

CCP molecule: CCP12 and CCP13 nucleotide sequence (CKSBC98-7 C-term and N- 
te rm, r espectively) 

aaaataatcgatggagactctaaaaagaaqaaaaataaqaataagaagaagaqaagccatgaaga 
tacggagatagaaccggagcaaaagatgagtctcgacggagactcqagggaqqaqaagataaaga 
agaagaggaagaacaagaaccaagaggaggagccagaqcttgtgacgqagaaa acgaaagtccaa 
qaqgaggaaaagggaaatgtagaagagggtagagccactgttagcatagccat aqctggttcaat 
catccacaacactcaatcacttgagctcgccacacgcgtaatc-bctctttctctctatctctccc 
ttcgtttctctqtttttccattcccagataatttaaaqtccccttcttccatt-tctaacatttct 
caqctcgccqgccaaat:tgctcgtgcagctacaatttt.ccqaatcgacqaqatcg-tagtgttcga 
caa-taagagcagctcagaaatcgaatcagctgctacgaatgcttctgatagcaatgaaagtggtg 
cctcctttctcgttcgtatcttqaagtatctagagacaccacaatatttgaggaaatctctcttc 
cccaagcaaaatgatcttagatatgtgggtatgttgcc gggtatgttgccacctcttgatgctcc 
tcaccatctgcgtaagcacgagtgggaacaataccgtgaagnnnnnattgttccaccctctaagc 
caaa aaaaaaagcaaaaatatattaaaqatacaaaatacaatatacatc acaattaa 

- in CKSBC98-7 C-term 

attcaatattcaaaa aataccc tttcaaqgataattacaattatt taattqatacctcaaagcac 
ggcctqgtaattaqttcatctgaqctgaaaataccaacatttaagcacc tattqattqcatttgg 
tqaacttqctgqgcttqaaqaaagtattqaagataataatcaqtataaq gqqaaaaacqttcqaq 
atatatttaatatatacttqaatacttqtccacatcaaaataq ccaaaccattcaaacaaagqaa 
gcaatqtttatatcacttcaqtacttccaqgaaccc atcaqca gqgcagtqaqaaqac r.T:[^QC 
ttea^t^^acm^caaaaaajyctattttattctcataaatctaaqqtttatcta aaaaagaqt 
gatgtaa tgtaac tgttttagaaaaaaaaaaaaaaaaaa 



B. 

CCP molecule: CCP12 and CCP13 amino acid sequence (CKSBC98-7 C-term and 
N-term, respectively): 

MGKKNKRSQDESELELEPELTKIIDGDSKKKKNKNKKKRSHEDTEIEPEQKMSLDGDSREEKIKK 
KRKNKNQEEEPELVTEKTKVQEEEKGNVEEGRATVS IAIAGSIIHNTQSLELATRVISLSLYLSL 
RFSVFPFPDNLKSPSSISNISQLAGQIARAATIFRIDEIWFDNKSSSEIESAATNASDSNESGA 
SFLVRILKYLETPQYLRKSLFPKQNDLRYVGML PGMLPPIsDAPHHLRKHEVIEQYREXXIVPPSKP 
REEAGMYW GYKVRYASOLSSVF KECPFEGGYDYLIGTSEHGLVISSSELKIPTFR HLLIAFGGLA 
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A, 

GCP molecule: CCP14 nucleotide sequence (CKSBC103-19): 

cctcclbctgtttc^ 

gtaaacacatatactcagctgcttctggagctttcctctcttatctctcctttggcttctcctca 
aatcttcacttccttgtcccaatgacgattggttacgcttcaatggcgatttatcgacccttgtc 
tggattcattactttcttcctaggcttcgcttatctcattggctgtcatgtgttttatatgagtg 
gtgatgcttggaaagaaggaggaattgattctactggagctttgatggtattaacactgaaagtg 
atttcgtgttcgataaactacaacgatggaatgttgaaagaagaaggtctacgtgaggctcagaa 
gaagaaccgtttgattcagatgccttctcttattgagtactttggttattgcctctgttgtggaa 
gccatttcgctggcccggttttcgaaatgaaagattatctcgaatggactgaagagaaaggaatt 
tgggctgtttctgaaaaaggaaagagaccatcgccttatggagcaatgattcgagctgtgtttca 
agctgcgatttgtatggctctctatctctatttagtacctcagtttccgttaaGtcggttcactg 
aaccagtgtaccaagaatggggattcttgaagagatttggttaccaatacatggcgggtttcacg 
gctcgttggaagtattactttatatggtctatctcagaggcttctattattatctctggtttggg 
tttcagtggttggactgatgaaactcagacaaaggctaaatgggaccgcgctaagaatgtcgata 
ttttgggggttgagcttgccaagagtgcggttcagattccgcttttctggaacatacaagtcagc 
acatggctccgtcactacgtatatgagagaattgtgaagcccgggaagaaagcgggtttcttcca 
attqctaqctacqcaaacc gtcagtgctgtctggcatggactgtatcctgga'bacattatattct 
ttgtgcaatcagcattgatgatcgatggttcgaaagctatttaccggtggcaacaagcaatiacct 
ccgaaaatggcaatgctgagaaatgttttggttctcatcaatttcctctacacagtagtggt-bct 
caattactcatccgtcggtttcatggttttaagcttqcacgaaacactagtcgccttcaagagtg 
tatattacattggaacagttatacctatcgctgtgcttcttctcagctacttagttcctgtgaag 

cctgttagaccaaaaacfibacE^ 

gttcttttctttttttccacttggnccgttttatgtaaaacaagagaaatcaagatttqaggttt 
tattcttaaaaaaaaaaaaaaaaaa 



B. 

CCP molecule: CCP14 amino acid sequence (CKSBC103-19): 

MELLDMNSMAASIGVSVAVLRFLLCFVATIPISFLWRFIPSRLGKHIYSAASGAFLSYLSFGFSS 
NLHFLVPMTIGYASI^IYRPLSGFITFFLGFAYLIGCHVFYMSGDAWKEGGIDSTGALMVIiTLKV 
ISCSINYNDGMLKEEGLREAQKKNRLIQMPSLIEYFGYCLCCGSHFAGPVFEMKDYLEWTEEKGI 
WAVSEKGKRPSPYGAMIRAVFQAAICMALYLYLVPQFPLTRFTEPVYQEWGFLKRFGYQYMAGFT 
ARWKYYFIWS I SEAS I IISGLGFSGWTDETQTKAKWDRAKNVDILGVELAKSAVQI PLFWNIQVS 
TWLRH YVYERI VKPGKKAGFFQLLATQT VSAVWHGL YPG YI I FFVQS ALMI DGSKAI YRWQQAI P 
PKMAMLRt^VLINFLYTVWLNYSS 

PVRPKTRKEE *" * ' 
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A. 

CCP molecule: CCP15 nucleotide sequence (CKSBC199-20): 

ttatataacctatctacactttgatctccgacaat tca ctttccc^taagaaccaactgagaga 
gagagagcgccggagaagaagaattttagagagcgg^jgaf^^ 

atggatgctttcgagaagcttgagaaagttggtgaagggacatacgggaaagtttacagagccag 
agagaaagctaccgggaaaatcgtcgctctaaagaagacgcgtctccatgaggacgaagaaggcg 
ttccttccaccactctccgcgagatctccattttgcgaatgctcgctcgtgatcctcacgtcgtc 
aggttaatggatgttaagcaaggactaagcaaagaaggcaaaactgtactgtacctagttt±tga 
atacatgqacactgatqtcaagaaattcatcagaagtttccqtagcactggcaaq aacattccaa 
cccaaactatcaagagcttgatg-batcaactatgcaaaqgtatggcattctgccatggtcacggg 
atattgcacagagatctcaagcctcacaatctcttqatggatcccaagacaatgagq ctcaaaat 
agcagatcttgg-bttagccagagccttcactctgccaatgaagaaqtataccc atqaqatattaa 
ctctttggtatagagctccaga ggtt- cttcttggtgccacccattactctacagctg 

in CKSBC199.20: ngntt 

tqqatatgtgg t ctgttggctgcatatt'bgctgaacttqtgaccaaccaagcaatcttt 

n in CKSBC199.20 

caqq qaqactctgagctccaacaqctcctccatattttcaagttgtt t qqqacacccaa 

in CKSBC199.20: Z 

tqaagaaatgtggccaggagtgagcacactcaagaactggcatqaatacccacagtgg aaaccat 
cqactctatc c tctgctgttccaaacctcqacgaggctggagttgatcttcta 

- in CKSBC199.20 

tctaaaatgctgcagtacqagccagcgaaacgaat c tcagcaaaqatqgctatggagca 

a in C KSBC199.20 

tccttactttc^tg^ 

tt-bccaagttttatgqtttttctagttttgcttctttcaaqcatatctctagtgtqctgcttccc 
cctctatgaa 



B. 

CCP molecule: CCP15 a mino acid seq uence (CKSBC199-20): 

MnTr.rzVTAV.qAMnAFF.KT.RKV je^ 

RMIjARDPHVVRLMDVKOGLSKEGKTVLYL VFEYMDTDVKKFIRS frstgkniptqtikslmyqlc 
KG^FCHGH GILHTOSK^^ 

— ~~ " ' "~ * * ★ * * ****** 

ATHYSTAVDMWSVGCIFAELVTNQAIFQGDSELQQLLHIFKLFGTPNEEMWPGVSTLKNWHEYPQ 
*** ** ** * ★****++ + ++++ ++ ++ +++ + ++++ + 

..■■.•...» : r\. ' . • * mil i 

WKPSTLSSAVPNLDEAGVDLLSKMLQYEPAKRISAKMAMEHPYFDDLPEKSSL - 
1 11 1 1 1111 1 1 11 1 1 1 HI 1 
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A. 

CCP molecule: CCP16 nucleotide sequence (E2F5BBC1): 

tagtcaacgatggatttga'gacatgaacaactaat^gatttigatttcgtg 
attggtaaattgtgtagaga^ 

aggcaacatcct.tcagtgagcgttgagaaaactccagtgagaaggaaattgattgttgatgatga 
ttctgaaattggatcagagaagaaagggcaatcaagaacttctggaggcgggcttcgtcaattca 
gtgttatggtttgtcagaagttggaagccaagaagataactacttaGaaggaggttgcagacgaa 
attatttcagattttgccacaattaagcaaaacgcagagaagcctttgaatgaaaatgagtacaa 
tgagaagaacataaggcggagagtctacgatgcgctcaatgtgttca1:ggcgttggatattattg 
caagggataaaaaggaaatccggtggaaaggacttcctattacctgcaaaaaggatgtggaagaa 
gtcaagatggatcgtaataaagttatgagcagtgtgcaaaagaaggctgctt-ttcttaaagagtt 
gagagaaaaggtctcaagtcttgagagtctt.atgtcgagaaatcaagagatggttgtgaagactc 
aaggcccagcagaaggatttaGcttaccattcattctacttgagacaaacGCtcacgcagtagtc 
gaaatcgagatttctgaagatatgcaacttgtacacctcgacttcaatagcacacctttctcggt 
ccatgatgatgcttacattttgaaactgatgcaagaacagaagcaa 

in E2F5BBC1: g 

gaacagaacagagtatcttcttcttcatctacacatcaccaatctcaacatagctccg^tcattc 
ttcatcc agtt cttgcattgcttctggaacctcaggcccggtttgctggaactcgg^tecat^g 
atactc^cjjgccgagcttctattcccaaattcttcaagaagaagaagtaatgatctaattggta 
tactaaaaaattatacatctggtttagtgttcaattgagagagactgl-aaaatcaattcataggc 
caacaaatgtttgtttatccaattttcctttttattcgaacttgatgcgatatttcaacggaaac 
agaaactattgttttaaaccaaaaaaaaaaaaaaaaaa 



B. 

CCP molecule: CCP16 amino acid sequence (E2F5BBC1): 

MSMEMELFVTPEKQRQHPSVSVEKTPVRRKLIVDDDSEIGSEKKGQSRTS 



GGGLRQFSVMVCQKL 



EAKKITT YKE VADE 1 1 SDF 



_J AT 1 KQNAEKPLNENE YNEKNJRRK^^ 
WKGL^ITCKKDVEEVKMDRNKVMS SVQKKAAFIiKE'L REB^S S LE SLMSRNQEMWKTQGPAEG FT 
LPFI IiT.iETNPHAWE IE 1 SEDMOLVHUD FNS TPFS VHDDAY 1 LKLM QEQKQE QNRVS S S SSTHHQ 
SQHSSAHSSSSSCIASGTSGPVCWNSGSIDTR 
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A. 

CCP molecule: CCP17 nucleotide sequence (FL67BC4-2): 

caaattctctggaagaagaagaagacgaaggEcaaccgacagagacgtcgcagccggcgccgtc 
ggatcaaggccgccggcttaaggatcagttatcggagagtatgagcttcagtagccaaatgaaga 
aggaagacgatgagttgtcgatgaaagctttgtcggcgttcaaggccaaagaagaggagatcgag 
aagaagaagatggagatcagagaaagagttcaagctcagcttggtcgtgttgaagatgagtccaa 
gcgtctcgctatgattcgcgaggaacttgaaggttttgctgatcccatgaggaaggaagttacta 
tggtgaggaagaagattgattctctcgacaaagaattaaagccattggggaatacagttcagaaa 
aaggaaacagagtacaaggatgctcttgaagcattcaatgaaaagaacaaggagaaggtggagct 
gatcaccaagctacaggagttggaggga gaaa gcgagaaa.ttcagg1:tcaagaagctggaggagc 
taagcaagaacattgatctaaccaa^ct^ 

ttcaaagttctaaaaaagtcactttttagagtattttcattgttcttttatgattctagtaatat 
atataatttataaaataaaaagtaagaagatatgtgtttgaactagatgttgcaaagaaaatgta 
acaaagttacgatggcactacattatcgacgtgattggcagaattgtaatagtaatgtaaagaaa 
ctatgtttgttccggaaaaaaaaaaaaaaaaaaaaaaaaaa 



B. 

CCP molecule: CCP17 amino acid sequence (FJL67BC4-2): 

MQPTETSQPAPSDQGRRLKDQLSESMSFSSQMKKEDDELSMKALSAFKAKEEEIEKKKMEIRERV 
QAQLGRVEDESKRIJ^IREELEGFADPiyiRKEVTMVRKKIDSLDKELKPLGNTVQKKETEYKDALE 
AFNEKNKEKVELI TKLQELEGESEKFRFKKLEELSKNI DLTKP 
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A. 

CCP molecule: CCP18 nucleatide,sequence4pl56mC12-17): 

aagaaagaagaaggcteg£:fcGaoaa;gaccact^^ 

agagagttg.gagtcaattccattcccgccattgaagaagttaacatttttaaggatgatgtagtc 
attcagttcattaaccctaaagttcaagcttcaattgctgctaacacatgggttgtgagtggtac 
acGacagacgaaaaaattgcaagacattcttcctcagattatcagccaacttggaccagataact 
taaaGaacctqaaqaaqctiaqcaqagcaatt ccagaaacaagctcGaggtgcaggtgatgtccca 
gcaacaatccaagaagaggacgatgatgatgatgtcccaga ^c ttgtagtqqqagagactttcga 
gacccctgctactgaagaggctGCcaaagc^ 

agagctcacctgGaaaacccatcataaaaatgtttgtcgctcgacctcttctgagcactgtcaga 
ttcttgttttctctaatgcttgcgaacagaaagacttggttttattatcacttgatgctttttgg 
tccgaacagcaattttcc-btttattaaggttagatcqctttttgtttaccacctgttcaaatgag 
tactactatgtcctgtcgcttcatacacttcttgcaacacagtcctttgttttgagtcaaaaaaa 
aaaaaaaaaaaaaaaaaaaaaaaaa 



B. 

CCP molecule: CCP18 amino acid sequence (FL67BC12-17): 

MNREKLMKMANTVRTGGKGTVRRKKKAVHKTTTTDDKRLQSTLKRVGVNS I PAI EEVNI FKDDW 
IOFINPKVQASIAANTWVVSGTPdTKKLODILPQIISQLGPDNLDNLKKLAEQ FQKQAPGAGDVP 
AT I QEE DDDDDVPDLWGET FET PATEE APKAAAS 
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A. 

CCP molecule; CCP19 nucleotide sequence (JUT1): 

tatccqatqaccttatcccc^cgccggtgagcgaa tct cagatccaaaattttgcaaaatcctca 
gatcgtcttaccttctccgaatcgatcgatttttcfj^ 

ccatctccgggagattcttccctttcaccacaagctcctccttctccgccgattttgccaacaaa 
cgacgtgacggtggccgtcgtgaagaaaccacaaccggggctttcttctcaatctccgtccatga 
acgctttagcgttagtggttcatactccttctgtaaccggtggtggtggtagcggaaacagaaac 
ggacgaggaggaggaggaggaagcggtggtggtggaggaggaagagatgattgttggagcgaaga 
agctacaaaggttctaatcgaagcttggggagatcgattctctgaaccaggtaaaggaactttga 
agcaacaacattggaaagaagtagctgagattgtgaacaagagtcgtcaatgcaaataccctaaa 
actgatattcagtgtaagaacagaattgatacggtgaagaagaagtataagcaagagaaagctaa 
gattgcttctggtgatggacctagtaaatgggttttcttcaagaagcttgagagtttgattggtg 
gtactacaacattcattgcttcttcaaaagcttcagagaaggctcctatgggaggagctcttggg 
aatagccgttcgagtatgtttaaacggcaaactaaaggtaatcagattgtgcagcaacaacaaga 
gaagagaggctctgattcgatgcggtggcattttaggaaacgtagtgcttctgagactgagtctg 
agtctgatcctgaacctgaggcttctcctgaggaatctgctgagagtctcccacctttgcaaccg 
attcaaccgctttcgtttcatatgccaaagcggttgaaggtggataagagtggaggtggagggag 
tggagttggagatgtggcgagggcgatacttggatttacggaagcttatgagaaggcggaaactg 
ctaagcttaagttaatggcggaactggaaaaggagaggatgaaatttgctaaagagatggagttg 
cagagaatgcagttcttgaaaactcaattggagataacacagaacaatcaagaagaggaagagag 
gagcaggcagcgag gaga aaggaggatcgttgatgatgatga-tgatcgcaatggcaagaataacg 

ttttaggttttgatgttgttgttgttgttattgttac^ 



B. 

CCP molecule: CCP19 amino acid sequence (JUT1): 

MEDDDEIQSIPSPGDSSLSPQAPPSPPILPTNDVTVAWKKPQPGLSSQSPSMNALALWHTPSV 
TGGGGSGNRNGRGGGGGSGGGGGGRDDCWSEEATKVXIEAWGDRFSEPGKGTLKQQHWKEVAEIV 
NKSRQCKYPKTDIQCKNRIDTVKKKYKQEKAKIASGDGPSKWVFFKKLESLIGGTTTFIASSKAS 
EKAPMGGALGNSRSSMFKRQTKGNQIVQQQQEKRGSDSMRWHFRKRSASETESESDPEPEASPEE 
SAESLPPLQPIQPLSFHMPKRLKVDKSGGGGSGVGDVARAILGFTEAYEKAETAKLKLMAELEKE 
RMKFAKEMELQRMQFLKTQLEITQNNQEEEERSRQRGERRIVDDDDDRNGKNNGNVSS 
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CCP molecule: GCP20 and*GGP21 nucleotide sequence^ JUT2 and JUT3, 
respectively) : 

aagctttactactt^ 
caaccctaatcccqattccacgto^ 

gcgtagacgaacccgactctctcgattccttct c ctccatgagt'Ctt: 

in JUT2 (N-term) : n 
aactccgacgaacctaatcagacttctaatcaatcgcctctttctccccctacgcccaatttacc 
ggtgatgcctcctccgtccgtgctticatctttcctttaaccaagatcatgctt: 
in JOT2 (N-term) : t t 

gcttcgc - tgt c ggcactgaccgtggcttc ~cg gat c cttaatt:gcgatccct'ttcgcg 

c n a n n in JUT2 (N-term) 

agattttccggcg t gatt tcgatcgtggcggtggtgttgcagtcgtggagatgcttttc 

g in JUT2 (N-term) 
agatgcaatatattagccctagttggtggcggacctgatcctcaatatcctcctaataaggttat 
gatttgggatgatcaccagggccgatgtatcggagaactctctttcaggtccgatgttagatccg 
tccggcttaggagggatcggattattgtcgttcttgagcagaagatttttgtctacaatttctct 
gacctcaagctgatgcatcagattgaaaccattgccaaccctaagggtttgtgtgctgtttctca 
gggtgttggttctatggttttggtatgtccaggtttgcagaaaggtcaagttcggatcgagcact 
acgcttctaaacggaccaaattc gtca tg 

in JUT 3 : - 

gctcatgattccagaatagcttgcttcgctctcacgcaggatggccatttgttggccactgctag 
ctctaagggtactctggttcggatcttcaatactgttgatggtaccttgcgtcaagag tctggca 

in JUT 3 : 

cttctgaggatgaaatag gtaaggaggg - tgcggatagagcagagat 
g in jut 3 

ctacagtttggccttctcttcaaatgctcagtggttagctgtctcaagtgacaaaggaacggtcc 
atgtctttggtctcaaagtcaactccgga-tctcaagtgaaagactcatcccgaattgcacctgat 

gctactccctcatccccatcgtcgtctctgtctttattcaa agt 

in JUT3 : agg 

gttaccgaggtatttcagctcggagtggtcggtggctcagttcaggttggttgaaggaactcagt 
acatagccgcctttggccatcaaaagaacaccgttgttattcttggcatggatgggagcttctac 
agatgccagtttgatccggtgaacggcggtgaaatgtctcagcttgagtaccacaactgtctgaa 
accgccttoagrttttc [ 

tctctgcaacttaagcggtgagatatggtgtatagttttgtgtatataataatgatgggtcgtcc 
tataatttgtaaaacct:tttatcgct:acccgggtcgactct:agagccct:atagtgagtcgtatta 
ctgcagagatctatgaatcgtagatactgaaaaa ; - 
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CCP molecule: CCP20 and CCP21 amino acid sequence (JUT2 and JTJT3, 
respectively): 

MATVSSSSWPNPNPNPDSTSASDSDSTFPSHRDRVDEPDSLDSFSSMSLNSDEPNQTSNQSPLSP 
PTPNLPVMPPPSVLHLSFNQDHACFAV GTDRGFRILNCDPFRE I FRRDFDRGGGVAWEMLFRCN 
ILALVGGGPDPQYPPNKVMIWDDHQGRCI GELS FRSDVRSVRLRRDRI I WLEQKIFVYNFSDLK 
LMHQIETIANPKGI ^7V yo ^^°^ 7T ^ p ^ Tn ^™ 7PTF - H ^^ 

HLLATASSKGTLVRIFNTVDGTLROE SGTSEDEIGKEGAnRAF,IYSLAFSSNAOWLAVSSDKGTV 

VRR in JUT 3 

HVFGLKVNSGSOVKDSSRIAPDATPSSPSSSLSLFK — VT.PRYFSSEWSVAOFRTjVEGTOYIAAFG 

-— — — - G in JUT 3 

HOKNTWILGMDGSFYRCOFDPVNGGEMSOLEYHNCLKPPSVF 



FIGURE 21 



WO 01/85946 



PCTYIB01/01307 



mm 

GGP molecule: 2 G(IP22 nucleotide 

agagcttcctctctctatatctggctttcti 

tgagaagcctctgaagcttctcactgaagaagacatttctcagcttactcgcgaagattgccgca 
aattcctcaaagagaaaggtttcttcttcttcctttctccatttttttccggtcttattgtcttc 
gacgaatggcggctgacacgtgtcgaaacaggaatgcgcaggccttcgtggaataaatctcaggc 
gatccagGaagttttatctcttaaagctctctatgaacctggagatgattccggcgccggaatcc 
tccgcaagatccttgtttctcagccgccaaatccgcctcgcgttacaacaacgttgattgagcca 
aggaacgagctcgaagcttgtggaaggattcctttacaggaagatgatggtgcgtgccatagaag 
ggattctccaagatcagctgagttttctggtagttctggtcagtttgttgcggataaagatagcc 
acaagactgtttctgtttcccccagaagcccagctgaaacaaatgcggtggttgggcaaatgacg 
atattttatagtggcaaagtgaatgtatatgatggagtaccacctgaaaaggcccggtctatcat 
gcattttgcagccaatccaattgatttgcctgaaaatggtatttttgcttctagtagaatgattt 
cgaaacccatgagtaaagagaagatggtggagcttccccaatatggacttgaaaaggcacctgct 
tctcgtgattctgatgttgagggtcaggcgaacagaaaagtatcgttgcaaagatatcttgaaaa 
gcggaaagacagattttctaagaccaagaaggctccaggagttgcgtcctctagcttggagatgt 
ttctgaatcgtcagccacggatgaacgctgcatattcacaaaaccttagtggcacagggcattgc 
gagtcacctgaaaatcaaacaaaaagtcccaatatctcagttgatctaaacagtgatctaaacag 
cgaaggtgccaaaagaactggagatggtactacgggtcaaaaggcgggaagaacaatttcatgtt 
cttataacatgactaagacatcacgaggaacacgatgggtgaagcggtcaagagaagaagtgatt 
caagcttggtatatggatgatagtgaagaggatcagagacttcctcaccacaaggatcctaaaga 
gtttgtatcgttggacaaacttgcagagctgggagtacttagctggagacttgatgctgataact 
ataaaaccaataaaaatttaaaaaaaatccataaatctcgtqqttactcttac atggacttttgt 
gaggtatgcccggaaaagcttccaaactatgaagtgaaagtgaagagctttttcgaagaacattt 
acacactgatgaggagatccgttactgcgttgcaggaactggttactttgatgtgagagatcgta 
atgaagcttggattagggtattggtaaagaagggaggtatgatagtcttacctqctgggatctat 
catcgcttcactgtggactctgacaactatatcaaggcaatgcggctattcgtgggtgaaccggt 
atggacaccatacaatcgcccacacgaccatcttcctgcaaggaaagaatatgtcgataacttca 
tgjatoaat^ccteggctHBB^ 

aaacaaggcctacaataaagaaagctttcctgtcaagtattggatatttatatgtattcctgtgt 
agaatgatggcttttggtatgcttgagttgttgtaaacttagttacactctctgatatgtctctc 
tttaccatctttgtcgtatcccatatacgaaaagattacattgggattcatattgtcttacgttic 
gttcctatgtgcaatatgttgagtttt 



FIGURE 22 



WO 01/85946 PCT/IB01/01307 



23/65 



CCP molecule: CCP22 amino acid sequence (JUT6): 

MDVGVTTAKSILEKPLKLLTEEDISQLTREDCRKFLKEKGFFFFLSPFFSGLIVFDEWRLTRVET 
GMRRPSWNKSQAIQQVLSLKALYEPGDDSGAGILRKILVSQPPNPPRVTTTLIEPRNELEACGRI 

PLQE DDGACHRRD S PRSAEFSGSSGQFVADKDSHKTVSVS PRS PAETNAWGQMTI FYSGKVNVY 
DGVPPEKARSIMHFAANPIDLPENGIFASSRMISKPMSKEKMVELPQYGLEKAPASRDSDVEGQA 

NRKVSLQRYLEKRKDRFS KTKKAPGVAS SSLEMFLNRQ PRMNAAYS QNLS GTGHCE S PENQTKS P 
NISVDLNSDLNSEGAKRTGDGTTGQKAGRTISCSYNMTKTSRGTRWKRSREEVIQAWYMDDSEE 

DQRLPHHKD PKE FVS LDKLAELGVL S WRLDADN YET DE DLKKI RE S RG YS YMDFCE VC PEKLPN Y 
EVKVKSFFEEHLHTDEEIRYCVAGTGYFDVRDRNEAWIRVLVKKGGMIVLPAGIYHRFTVDSDNY 

I KAMRLFVGEPVWT P YNRPHDHL PARKE YVDNFMINAS A 
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A. 

CCP molecule: CCP23 nucleotide sequence (kbpl): 

catcqcttttcqctqaaatcaaaatttctccagtttt-ccgatcagtcqcaaqaaaaccc 

c in KBP1 

g in KBP1 

tttgtccaaaatcttctccagcagatgcaaaccaggttccagacaatgtcggactccat catcac 
aaagattgatgacatggqaggcagaatcaatgagctggagGaaagcatcaatgatctaagagccg 
agatgggagt a gaaggcactcctcctccagccticcaaatcaggcgatgaacccaaaacaccggct: 

tatgttaagagGagtattacttttaaaattcctgttt t aagaaacgagtttgttgtttattaaag 

- in KBP1 

ttcatcaaatagattgatgatgtggtgcattacattattctccacctatgaattgcatttctatt 
ttggtctaaaaaaaaaa 

B. 

CCP molecule: CCP23 amino acid sequence (kbpl): SEQ ID NO:89 

TSFPITRKKTL KMDGHDSEDTKQSTADMTAFVQNLLQQMQTRFQTMSDSIITKIDDMGGRINELE 
QS I NDLRAEMGVEGT P P PAS KSGDE PKT PAS S S 



CCP molecule: CCP23 amino acid sequence (kbpl): SEQ ID NO:118 

MDGHDSKDTKQSTADMTAFVQNLLQQMQTRFQTMSDSIITKIDDMGGRINELEQSINDLRAEMGV 
EGTPPPASKSGDEPKTPASSS 
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CCP molecule: CCP24 nucleotide sequence (kbp3); 

agaacaattgagattcttggttgtgttaagg^aaatctacaccatgaaaacgaattttcttgt 

actggctttgtctttgtgtatccttctttcaagcttccatgaggtttcttgtcaggatgatggta 

gtggtttgagtaatttggatctaatagaacgtgattatcaagatagtgtcaatgctcttcaaggc 

aaggacgatgaagatcagtctgcaaagatacagagtgaaaaccagaataacactacagtgactga 

taagaacactatttctctatctctatcagatgaatctgaggttggatctgttagtgatgaaagcg 

ttggacgttcgagtctgttggatcaaatcaaacttgaattcgaagctcatcacaatagtattaac 

caagctggatctgatggtgtcaaggctgaatccaaggatgatgatgaagaattatctgctcatag 

acagaaaatgttggaagaaatcgaacatgagtttgaagctgcttcagatagtctgaaacaactaa 

agactgatgatgtaaacgaaggaaatgatgaagaacattctgcaaagaggcaaagtttgttggaa 

gagatcgaacgtgagtttgaagctgctacaaaagaacttgaacaactaaaggttaatgacttcac 

cggggacaaagatgacgaagaacactctgcaaagagaaaaagtatgcttgaagctattgaacgcg 

agtttgaagctgctatggaaggcattgaagcacttaaggtttctgattccacaggaagcggagat 

gatgaagaacaatctgcaaagagactaagtatgcttgaagagatcgaacgggaatttgaagctgc 

ttcaaaaggtcttgaacaactaagggctagcgattcaaccgcggacaataacgaagaagaacacg 

ctgcaaagggacaaagtttgttagaagagatcgaacgagagttcgaagctgctacagagagcctt 

aagcaacttcaagttgatgattctactgaagacaaagaacactgtaaagcactcttcttcttatt 

atctgctattctttctctatggttatctgaatcaggctttgaatgtattgtagttacagctgcaa 

agaggcaaagtctgctggaagagattgaacgtgaatttgaagctgcaacaaaagatcttaaacaa 

ctaaatgatttcactgaaggcagtgctgatgatgaacaatctgcaaagagaaacaaaatgttgga 

agatatcgaacgcgaatttgaagctgctacaataggtcttgaacaactaaaggctaatgatttct 

ctgaaggcaataataatgaagaacaatctgcaaagagaaagagtatgcttgaagagatcgaacgc 

gagttcgaagctgctattggaggtcttaaacagatcaaagttgatgattccagaaatcttgaaga 

agaatctgctaagagaaagataattttggaagagatggaacgtgaatttgaagaagcacacagtg 

gtattaatgcaaaggctgacaaagaagaatctgcaaagaaacagagtggctctgctataccagag 

gttcttggactaggacagtcaggtggttgtagctgttctaaacaagacgaagattcctcgattgt 

tataccaacaaaatatagcatagaagatatcctctctgaagaatctgcagtccagggaacagaga 

cttctagtctcaccgcgtctttgactcaactcgttgagaatcacaggaaagaaaaggaatctcta 

ctcggacacagagttctcacttctccttctatagcttcttccacaagcgaatcatctgctacatc 

agagactgtagaaaccctaagggctaaactgaatgagcttcgcggcttaaccgctcgtgagcttg 

tgacacgtaaagatttcggtcagattctcat-tacggctgcgagttttgaagagctaagttcagct 

ccaatcagttacatttctaggttagctaaatacagaaacgtcatcaaagaaggacttgaagcttc 

t.aaaaaaqttcacat cqcqcaqqtacgaqcaaaaatqctcaaa qaaqttqccacqqaqaagcaaa 

ccqccgtqqacactcatttcqcaaccqctaaaaaqcttqctcaaqaaqgaqacqcqttgttcgt t 

aaaatcttcqcaatcaaqaaactgttqqcqaaacttqaaqcaq aqaaaqaatctqttqatggaaa 

qtttaaqqaqactqtqaaaqaactttctcatcttctqqctqatqcttctgaq qcftacqaaqagt 

atca tqqcqcqqtqaqqaaqqcqaaagacqaqcaagc g qctqaqqaatttqcqaaagaggcgacg 

caaaqtqcagaga1:catttq.qqttaaqtttcttaqttct C 1:lgjaqaacaa^q aqattct^ 

ttqtqttaaqaqcaaatctaqaqctcttqttqqttcttqttatqta ttttqtqatqatqttctgl: 

ttcaqaqtttqtqtqttgqttgtatcaqqaqaaaqaqqctggq aqatagagaqaaaqagagtctc 

tqcqaaaactaataatqttttttcaqatatctaaataataaqc tttttacaaaaaaaaaaaaaaa 

aaaaaaaaaa 
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GCP molecule: GCP24 amino acid sequencel(kbp3): 

IME'IT^^MKl/N'EiQ VLAIi'S. I> CI LBS S^EHE VS CQ D BG S Gli S NLDL I ER0 YQ D S^TNALQGKDDE DQ S AKI 
QSENQNNTTVTDK^ 
;SKDDDEE£SMi^ 
KELEQLKWDFTGDKDDEEHS^^ 

MLEEIEREFEAASKGLEQLRASDSTADNNEEEHAAKGQSLLEEIEREFEAATESLKQLQVDDSTE 

DKEHCKALFFLLSAILSLWLSESGFECIWTAAKRQSLLEEIEREFEAATKDLKQLNDFTEGSAD 

.DEQSAKRNKMLEDIEREFEAATIGLE.QLPCANDFSEGNNNEEQSAKRKSMLEEIERE 

QIKVDDSRNLEEESAKRKIILEEMEREFEEAHSGINAKADKEESAKKQSGSAIPEVLGLGQSGGC 

SCSKQDEDSSIVIPTKYSIEDILSEESAVQGTETSSLTASLTQLVENHRKEKESLLGHRVLTSPS 

IASSTSESSATSETVETLRAKLNELRGLTARELVTRKDFGQILITAASFEELSSAPISYISRLAK 

YRNVIKEGLEASERVHI AQVRAKMLKEVATEKQTAVDTHFATAKKLAQEGDALFVKIFAIKKLIA 

KLEAEKESVDGKFKETVKELSHLLADASEAYEEYHGAVRKAKDEQAAEEFAKEATQSAEIIWVKF 

LSSL 
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CCP molecule: CCP25 nucleotide seq uence (kbp6): 

aatttgaatccaatccccaaattatctcat gg^^ 

atcttttgttatcttcgtcattctcatgcttctcttcacctggctttctcgcaaatctggaaatg 

ctcccatttattacccgaatcggatccttaaagggctggagccatgggaaggcacctccttgact 

cgaaacccttttgcttggatgcgtgaagctttgacttcctctgaacaagatgtcgttaacttatc 

cggcgtcgatactgctgtccactttgtcttcttgagcactgttctggcrqatatttqcttgttcca 

qtcttcttctcctaccaactctactgcctctagccgctacagacaacaacataaagaacacaaag 

aatgcgacagataccacaagcaaaggaacttttagccaacttgataatctatcaatggctaacat 

cacaaaaaaaagttcgaggctgtggqcgttcctaggagctgtttactggatatcttt ggtcacat 

atttcttcttgtggaaagcttataagcatgtctcttcattgagagctcaagctctgatgtctgct 

qatgtaaaacccgagcaattcgctattcttqttagggatatgcctqcaccacctgacgg gcagac 

acagaaagagtttattgattcttatttcagagaaatataccctgagacattctacagatcgcttg 

tcgcaacagaaaacagcaaggttaataaaatatgggaaaaattggaaggttacaagaagaagctt 

gcgcgagcagaagcaatattagcagcaactaataaccgtcccacgaacaaaaccggcttctgtgg 

gctagtcggtaaacaagtagacagcattgagtattacactgagctaatcaacgagtctgtagcca 

aactggaaacagagcagaaagcggttcttgctgagaagcagcaaaccgcagcagtggttttcttc 

acaaccagggttgctgctgcatcagcagctcagtctctgcactgccagatggttgataaatggac 

tgtgaccgaagctcctgagccacggcagctcctatggcagaatctcaacatcaagctcttcagca 

gaataatccggcaatacttcatctacttctttgttgcagtgaccattctgttttacatgatacca 

atcgcgttcgtctctgccatcaccactcttaagaatcttcagaggattattccgttcataaagcc 

ggttgtggagataaccgccataagaaccgttttggagtctttccttcctcagattgcgctcattg 

ttttcttggccatgttgccgaagcttctcttgtttctctccaaagccgaggggattccttcacag 

agccatgccattagggctgcttcagggaagtacttttacttctcggtctttaatgtcttcattgg 

tgttacccttgctgggactttgttcaacacagtgaaggatatcgcgaaaaatcccaaactcgaca 

tgattattaaccttttggctactagcctccctaagagcgcaactttcttcctgacctacgttgct 

ctcaagttctttatcggttatggccttgagctgtctcggatcatacctttgataatcttccacct 

gaaaaagaagtatctctgcaaaaccgaagcggaggtcaaagaagcttggtacccgggagacttaa 

gctatgcgactagggttcccggagacatgctcatcctcacaatcacgttctgctattcagtcatt 

gctcctcttatcctcatattcggcatcacctactttggtttaggctggctagtcctcaggaatca 

ggcgttgaaagtgtacgttccatcatacgagagctatggaagaatgtggccgcatattcaccagc 

gcatactagcagcgttgtttctattccaagtggtaatgtttggctacttaggagccaagacattc 

ttctacacggcccttgtgatccctctcattatcacctctctcatcttcggatatgtgtgccgcca 

gaaattctacggagggttcgaacacacagctctcgaggtagcttgccgtgagctgaagcagagtc 

cagacctagaggagattttcagagcatacattccgcatagcttgagctctcacaaaccagaagaa 

c^acgagttcaaaggcgcaatgtctcgttatcaagatttcaacgcaatagca 

jgSgagattcctctggctaaacccag *'''* 
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GCP molecule: CCP25 amino acid sequence (kbp6): 

MEFGSFLVSLGTSFVPFVTBMLL^ 



LTSSE.< 


"JD-WNLS 




tCSSL^LL 


PTLL PLAAT.DNN I KNTKNAT DTT S KGT 


FSQLD2 


DESMAN I 


TKKS'SRL'WAFLGAVYWISI 


jVTYFFLW 


KAYKHVS SLRAQALMSADVKPEQFAIL 



VRDMPAP P DGQTQKE FIDSYFRE I YPET FYRSL VATENS KVNKI WEKLEG YKKKLARAEAILAAT 



NNRPTNKTGFCGLVGKQVDSIEYYTELINESVAKLETEQKAVLAEKQQTAAVVFFTTRVAAASAA 

QSLHCQMVDKWTVTEAPEPRQLLWQNLNIKLFSRIIRQYFIYFFVAVTILFYMIPIAFVS 

KNLQRIIPFIKPWEITAIRTVLESFLPQIALI^ 

YFYFSVFNVFIGVTLAGTLFNTVKDIAKNPKLDMIINLLATSLPKSATFFLTYVALKFFIGYGLE 
LSRIIPLIIFHLKKKYLCKTEAEVKEAWYPGDLSYATRVPGDMLILTITFCYSVIAPLILIFGIT 
YFGLG WL VL RNQ ALKVYVPS YE S YGRMW PH I HQRI LAAL FL FQ WM FG YL G AKT FF YT AL VI PL I 
ITSLIFGYVCRQKFYGGFEHTALEVACRELKQSPDLEEIFRAYIPHSLSSHKPEEHEFKGAMSRY 
QDFNAIAGV 



FIGURE 28 



WO 01/85946 



PCT/TB01/01307 



29/65 

CCP molecule: CCP26 nucleotide sequence (kb p9): 

aacaataagaagaaaaagtttcattttctggEgcgg^^gaagagt^ccaatatgtggaactg 

ggaggtgactgggttcgaatcgaagaagtcgccttctagtgaggaaggcgttcatcggacaccgt 

cttctatgcttcgacggtactcgatcccgaagaactcgcttccaccgcactcgtcggagcttgcg 

tctaaggttcagagtttgaaggataaagttcagcttgcaaaggacgattatgtgggattgagaca 

ggaagctactgatcttcaagagtactccaatgcgaagcttgaaagggttacacgttatttaggtg 

ttctggctgataaaagtcgtaaactggatcaatatgcacttgagactgaggctaggatatctcca 

cttatcaatgagaagaagagactgttcaatgacttactgacgaccaaaggtgcacatcttccatt 

tccgacgtcattctctatccttacttctattgatattgatcacaccagacccttatttgaagacg 

agggtccctctatcattgaatttcctgataactgcactatacgcgtaaacactagtgatgatact 

ctgtccaatcccaagaaggaatttgaatttgatagagtttatgggcctcaagttggacaagcttc 

actgttcagtgatgtccaaccttttgtgcaatccgctctggatggatcgaacgtttctatatttg 

cgtatggccaaactcacgcggggaagacatacaccatggttgcccctcctttccctttcctctct 

gaaattagatataggtcttgtttggatttaaatatgataggcaagttcatggacgttcatagtaa 

gttcatggacgaaggatctaatcaggaccgtggtttatatgctcgttgttttgaggaacttatgg 

acttggccaattctgattcaacttccgcatctcagttcagtttctctgtttcagtgtttgagctt 

tataacgaacaggtcagggatttactctcgggttgtcagagcaatttgccaaagatcaatatggg 

tttacgcgaatcggttatagaactttcacaggaaaaagttgataatccatcagagttcatgagag 

tcctgaactctgcatttcagaatagagggaatgataaatcaaagtctactgtgacccatctgatt 

gtctcgatacacatttgttatagcaacacaattacgagagaaaatgtaattagcaagctttcttt 

agttgacctggctggaagtgaaggtttaactgtggaggatgacaatggagatcatgtaactgatc 

tgctccatgtaacaaattcaatttccgcgctgggagatgttttatcatctttgacgtcaaaaaga 

gataccattccttacgagaactcatttcttacaagaatacttgcagattcactaggagggagctc 

caaaacattgatgatcgtcaacatttgtccaagtgcacggaacttgtctgaaataatgtcgtgtc 

tcaactatgctgctagagctcgaaatactgtaccaagccttgggaatcgagacacaattaagaaa 

tggagagacgtggcaaatgatgctcggaaggaggtattggagaaagagagggaaaatcagcgtct 

aaaacaagaggttacgggtttaaaacaagcacttaaagaagcaaatgaccaatgtgtactgctct 

ataatgaagtacagagagcgtggagagtttcattcacactgcaatcagatttaaagtcagagaat 

gcgatggttgtagacaaacataaaatagaaaaggagcagaattttcagttaagaaatcaaatagc 

tcaacttttacagttagaacaggaacaaaagctgcaggcgcaacaacaagattccaccattcaaa 

atctccagtctaaagtgaaagacttagaatcacaactaagtaaagctctgaagtctgacatgaca 

agatcgagagatcccttggaacctcagcccagagcagctgagaacacactcgattcttctgcagt 

taccaagaaacttgaggaagaattgaaaaaacgtgatgcactgattgagaggttgcatgaagaaa 

atgaaaaattgttcgacagattaacagaaaagtcagtggctagctcgactcaggtatctagcccc 

tcatcaaaagcttcaccaacagtgcagcctgcagatgttgacaggaaaaa-tagcgcgggcact-tt 

accgtcttcagtggataaaaatgagggcacgattacattagtaaaatccagctctgaattagtaa 

aaaccactccagctggagaatacttaacagctgcattgaatgattttgatcccgaacaatatgaa 

ggtcttgcagccatagctgatggcgcaaacaagcttctgatgctggtcttagcagctgtcataaa 

ggctggtgcttccagagagcatgaaatccttgctgagatcagagattctgtcttttcatttatcc 

ggaaaatggaaccaaggagagtaatggatacaatgcttgtttctcgagt'caggatattgtacata 

aggtccttacttgcacgatcacctgagcttcagtcgatcaaggtttctcctgtt'gaacgcttttt 

ggagaagccatatactggtcgaactagaagctccagcgggagtagcagcccaggtagatcaccag 
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ttcgatattatgatgagcagatttatggct.ttaaagttaatttaaagccagaaaagaaaagtaag 
ttggtatctgtagtttcaagaatccgtggacatgaccaggatactgggaggcagcaagtgactgg 
aggaaagctgagggagatacaagatgaagccaaaagttttgccat.tggaaacaaacccttagctg 
ctttattt gt:tcacaGtccggctggtgaactgcaaagacagattaggtcat:ggcttgcagaaagt 
tttgagtttctctctgttacagcag atgatgt-btcaggagtaaccactggccaattagagcttct 
ttccacaqcaattatggatggctggatgg ctggagtaggagct g cggtgcc a cctcacacagacg 
ctttaggacagct 

™ ~ " c t in KBP9 

tt tgtctgagtatgcaaa a cgagtctacacttctcagatgcagcatctaaaggatattg 
2 2 in KBP9 

ccggtacfttggcttcggaagaggcagaagatgctggtcaagtcgcgaagcttcgatcagctctc 
gagtctgttgaccacaaaagaagaaagattttgcaacaaatgagaagtgatgcagctttgtttac 
cttggaagaaggcagttcccctgttcaaaatccatctacagcagccgaagactcgagattagcct 
ccctcatttctctggatgccatactgaagcaagtcaaggaaataacaagacaagcctctgtccac 
gttttgagtaaaagcaagaaaaaggcattgcttgagtctcttgatgaacttaacgaacgaatgcc 
ttctctgcLtgatg-t-tgatcatccatgtgcacagagagaaattgatacggctcaccagttggtcg 
agacaattccagaacaagaggacaatcttcaagacgaaaagagaccttcaatagattcaatatct 
tcgactgaaaccgat:gtgtctcaatggaatgtt:t:tgcaattGaacacaggaggctcttcagct:cc 
attcatcataaaatgcggagctaactccaactcagagcticgtgatcaaagcggatgcccgtattc 
aagaacctaaaggaggcgaaatagtgagagttgtgccaagaccttcggttttagaaaacatgagc 
ttagaggaaatgaaacaagtgtttggtcagttgcccgaagctctaagttcactggccttagctag 
aacagctgatg gcacacgggctcgatactctagactctacagaactctagccatgaaggtt ccct 
ctcttaaaaacctcattaaaaaacttqaaaaaaaaqaaatcttai^qat afe a ^ 
.ti^attagggttttttcgtgaatttgaaa 
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CCP molecule: CCP26 amino acid sequence (kbp9): 

MAEQKSTNMWNWEVTGFESKKSPSSEEGVHRTPSSMLRRYSIPKNSLPPHSSELASKVQSLKDKV 
QLAKDDYVGLRQEATDLQEYSNAKLERVTRYLGVLADKSRKLDQYALETEARISPLINEKKRLFN 
DLLTTKGAHLPFPTSFSILTSIDIDHTRPLFEDEGPSIIEFPDNCTIRVNTSDDTLSNPKKEFEF 
DRVYGPQVGQASLFSDVQPFVQSALDGSNVSIFAYGQTHAGKTYTMVAPPFPFLSEIRYRSCLDL 
NMIGKFMDVHSKFMDEGSNQDRGLYARCFEELMDLANSDSTSASQFSFSVSVFELYNEQVRDLLS 
n mttt tt 1 I* o^n^T^MooTrTTMDTrr -Mc:aTPnMPnMr)KSKSTVTHLIVSIH.ICYSNT 



NMIGKFMDVHSKFMDEGSNQDKCiijX/m^£ ^Ji.xiwujjftwoi^oxorx^vc^'- ~— 

GCQSNLPKINMGLRESVIELSQEKVDNPSEFMRVLNSAFQNRGNDKSKSTVTHLIVSIHICYSNT 

ITRENVISKLSLVDLAGSEGLTVEDDNGDHVTDLLHVTNSISALGDVLSSLTSKRDTIPYENSFL 

TRILADSLGGSSKTLMIVNICPSARNLSEIMSCLNYAARARNTVPSLGNRDTIKKWRDVANDARK 

„^-,„ m/ -.-r T^-nT ^i?7\kir^or'Trr.T.VMT?\7nRZvWRVS'P'TT,055DLKSENAMVVDKHKIE 



TRILADSLGGSSKTLMI VN ICPSAKNJjSEiMbiJljN iftHRftRn j. v ro^uimuj. J-"""^ v 
EVLEKEP^NQRLKQEVTGLKQALKEANDQCVLLYNEVQRAWRVSFTLQSDLKSENAMWDKHKIE 

KEQNFQLRNQIAQLLQLEQEQKLQAQQQDSTIQNLQSKVKDLESQLSKALKSDMTRSRDPLEPQP 

FAAENTLDS SAVTKKLEEELKKRDALIERLHEENEKLFDRLTEKS VAS S TQVS S PS SKAS PTVQP 
ADVDRKNSAGTLPSSVDKNEGTITLVKSSSELVKTTPAGEYLTAALNDFDPEQYEGLAAIADGAN 

KLLML VLAAV I KAGAS RE HE 1 LAE I RD S VF S F I RKME PRRVMDTML V SRVRI L Y I RS LL ARS PEL 
QS IKVS PVERFLEKPYTGRTRSSSGSSSPGRS PVRYYDEQI YGFKVNLKPEKKSKLVSWSRIRG 



QSIKVSPVERFLEKPYTGRTRSSSGSSSFCjKSf VKi lutyi. i<jr rvvwx.rvirE.r>.i^.-.*>.* J v ^ > » — - — 
HrQrT ^p Q nw.r^T f T.PP.TOr>P.aTC.qFATGNKPLAAL FVHTPAGELQRQIR SWLAESFEFLSVTADD 
VSGVTTGQLELLSTAIMDGWMAGVGAAVPPHTDALGPLLSEYAKRV YTSQMQHLKDIAGTLASEE 

P in KBP9 

A.E DAGOV AKLRS ALE S VDHKRRKI LQQMRS DAAL FTLEEGS S PVQN PS TAAE PS RLAS L I S L DAI 
LKOVKE I TRQAS VHVL S KS KKKALLE S L DELNERMPSLLDVDHPCA ORE I DTAHQL VET I PEQED 
NLODEK RPSIDSISSTETDVSQWNVLQFNTGGSSAPFIIKCGANSNSE LVIKADARIQEPKGGEI 
VRWPRPSVLENMSLEEMKQVFGQLPEALSSLALARTAD GTRARYSRLYRTLAMKVPSLRDLVGE 

LEKGGVLKDTKST 
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A. _ ... .... 

GOP molecule: X3GR27 nucleotide seq uence (kbp 11): 

ttagttagataggcggi-g^^ 
cggtggagttgagagtccagtgacgtcatcag^ 

acaacccaccgactatgactcgttcggatccaagattggaccatgacttcaccaccaacaacagt 
ggaagccctaatacccagactcagagccaagaagaaGagaacagcagagacgagcaaccagctgt 
tgaacccggatceggatccgggtctacgggtcgtcgtcctagaggtagacctcctggttccaaga 
acaaaccaaagagtccagttgttgttaccaaagaaagccctaactctctccagagccatgttctt 
gagattgctacgggagctgacgtggcggaaagcttaaacgcctttgctcgtagacgcggccgggg 
cgtttcggtgctgagcggtagtggtttggttactaatgttactctgcgtcagcctgctgcatccg 
gtggagttgttagtttacgtggtcagtttgagatcttgtctatgtgtggggcttttcttcctacg 
tctggctctcctgctgcagccgctggtttaaccatttacttagctggagctcaaggtcaagttgt 
gggaggtggagttgctggcccgcttattgcctctggacccgttattgtgatagctgctacgtttt 
gcaa-tgc cacttatgagaggttaccgattgaggaagaacaacagcaagagcagccgcttcaacta 
gaagatgggaagaagcagaaagaagagaatgatgataacgagagtgggaataacggaaacgaagg 
atcgatgcagccgccgatgtataatatgcctcctaattttatcccaaatgg tc atcaaatggctc 
aacacgacgtgtattggggtggt cctcc^ 

" ^ ^ in KBP11 a 

ggcggtggttggtgcgttctttt-bactggaatgattatattttcca-ttaggatggttaggctttt 
gtt t attaaagctatcaagtttctttttttitt t acggataattcggatgacaattagctagtgtt 

- ±n KBP11 - in KBP11 

tgtttgtttgttttgtggc - ggcttttctg a cttgactattttgatcgcggatagctttgtatga 

c -in KBP11 

aagtgaattgattgtagaatcgtcttttgaattttgatgttggaaaaaaccaagcaatggtgtgt 

gg c ctttgcaatggaagc 
n in KBP 11 



B. 

CCP molecule: CCP27 amino acid sequence (kbpll): 

MANPWWVGNVAIGGVESPVTSSAPSLHHRNSNNNNPPTMTRSDPRLDHDFTTNNSGSPNTQTQSQ 
EEQNSRDEQPAVEPGSGSGSTGRRPRGRPPGSKNKPKSPVWTKESPNSLQSHVLEIATGADVAE 
SLNAFARRRGRGVSVLSGSGLVTNVTLRQPAASGGWSLRGQFEILSMCGAFLPTSGSPAAAAGL 
TIYLAGAQGQWGGGVAGPLIASGPVIVIAATFCN ATYERLPIEEEQQQEQPLQLEDGKKQKEEN 
DDNESGNNGNEGSMQPPMYNMPPNFIPNGHQMAQHDVYWGGPPPRAPPSY 
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CCP molecule: CCP28 nucleotide seq uence (kbp!2): 

aatttgctttatctttgcattgttgttggcg^ 

tgtaatccggatgttgaatctgaaccaacctttgaatccaagtggaactgcgaacgaagaagttt 
acaagatcttgatttacgataggttttgtcagaacattctatctccattgacccatgtcaaggat 
ctgcgtaagcatggagttacactcttctttctcatagacaaagatcgacaacctgttcatgatgt 
tcccgctgtctactttgttcaaccaactgaatccaacctccagaggatcatagccgatgcttcta 
gatctctctacgatacctttcatctgaatttctcgtcttcgatccctcgtaagtttcttgaagag 
ctagcttctgggactcttaaatctggttctgttgagaaagtctcgaaagtgcatgatcagtatct 
ggagtttgtgactttggaagataacttgttctcgctggctcagcaatctacctatgttcaaatga 
atgacccatcagcaggggagaaagagattaatgagattatcgaaagggtcgctagtggtttgttt 
tgtgtgttggtaacgcttggtgtggttcctgttatccgatgccctagtggtggacctgcagagat 
ggtggcgtctttgttggatcagaaactgagggatcatcttttgtccaagaacaatctgtttactg 
aaggtggcggtttcatgagctcgtttcagcgtcccctcttgtgcatatttgataggaactttgag 
ctctcggttgggattcagcatgat-ttcagataccggcctctcgttcacgatgttctcgggttaaa 
gctcaaccaattgaaagtgcagggagagaaaggaccaccgaaatcgtttgagctggacagttcgg 
acccattctggtcagcaaacagtactctggagtttccagatgtcgctgtggagatcgaaacacag 
ttgaacaagtacaagagagacgttgaagaggttaacaagaaaaccggaggtgggagcggcgctga 
gtttgatgggacagatctgattggaaacatccacaccgagcatctcatgaacactgtgaaatcgc 
tcccggagttaactgagcgaaagaaagtgattgacaaacacaccaatatcgcaacagcgctctita 
ggacagatcaaggagagatctattgacgctttcactaagaaagaaagcgacatgatgatgagggg 
cggaatcgacagaactgaacttatggctgctctgaaaggcaaagggacaaagatggacaagctcc 
ggtttgcaatcatgtacctgatctccacagaaaccataaaccaatcggaagttgaagcagtggag 
gcagcattgaatgaagctgaggctgatacaagtgcgtttcagtatgtaaagaaaatcaaatcgtt 
aaacgcatcttttgcagctacatcagcgaatt.cagctagcagaagcaacattgtagactgggccg 
agaagctttacggacagtctataagcgcagtgactgcaggagtcaagaatctgttatctagtgat 
caacaattggcagtgactcgaacagtcgaagctttaacagaaggaaaaccaaacccggagatcga 
ttcttaccgcttcctggacccaagagctccaaagtcgtctagctccggtggtagccatgtaaaag 
gaccgttcagagaagctatagtgttcatgatcggtggaggtaactatgttgagtatggaagtttg 
caggagttgactcagagacagttaaccgttaaaaacgttatttatggagccactgagattcttaa 
cggaggtgagttggtggagcagcttggacttttgggaaagaagatgggattaggaggtccggtcg 
cttcaacgctgaagaggctgggaatggctggtaaagaggagactgatgtatctgcacaagggtct 
ttaaccagggaggccactgagatatggaggagtgagttggaatctcgccggtttcaggtagatag 
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tttagaagctgaacttgtgga.tgtcaaggc.ttaccttgagtttggctcagaag^ 

aggagttaggagttctttcgggtagggtcaga.tcgactgcaactatgttgcgttatttaasatca 

aaagctagagtcttggccattcctgatgatctagcaaat:gtgt:cat:gcggtgtggaacagattga 

agaactgaaaggattgaaccttgttgagaaagatggtggttcatcttcttctgacggggctagga 

acactaatcctgaaactagaaggtacagtggttccttgggtgtagaggatggagcctatiactaat 

gaga-tgctccagtccatagagatggtt-actgatgtgc-tggactctcttgtgaggagggttacagt 

agcagaat ctgagtctgctgt t caaaaggagagggcacttttgggagaggaa - gaaatcagtagg 

- in KBP12 ^ — — ~ ^ ^ KBP12 

aa- gactatccaaatcgaaaatttgtccgtgaagttagaagagatggaacgatttgcttatggga 
a in KBP12 

ctaatagtgttctaaacgaaatgcgggaaaggattgaggaattagttgaagagacgatgaggcag 
agggaaaaagctgtggaaaacgaa g aggagttgtgt c gtgtgaagagagagttcgagtc g ct taa 

in KBP12 n nn n 

aagctacgtcagtacttttaccaatgttcgagaaacacttctttcgtccgagagacaattcaaaa 
ccattgaggagctctttgaacggttggtcactaagacgacacaattagaaggggagaaggcacaa 
aaggaggttgaagtacagaaactgatggaggagaatgtgaaattgacagcact^^ 
agaggctcagcttctagctttgaatgaacaatgcaaagttatggctttgagtg^^ 
^ct:ct:ctaatecaaccgaatctcaagcttcc 
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CCP molecule: CCP28 amino acid sequence (kbpl2): 

MALNLRQKQTECVIRMLNLNQPLNPSGTANEEVYKILIYDRFCQNII.SPLTHVKDLRKHGVTLFF 
L I DKDRQP VHDVPAVYFVQPTESNLQRI IADASRSLYDTFHLNFS S S I PRKFLEELASGTLKS GS 
VEKVSKVHDQYLEFVTLEDNLFSIiAQQSTYVQMNDPSAGEKEINEIIERVASGLFCVLVTLGWP 
VIRCPSGGPAEMVASLLDQKLRDHLLSKNNLFTEGGGFMSSFQRPLLCIFDRNFELSVGIQHDFR 
YRPLVHDVLGLKLNQLKVQGEKGPPKSFELDSSDPFWSANSTLEFPDVAVEIETQLNKYKRDVEE 

VNKKTGGGS GAE FDGTDL I GNIHTEHLMNT VKS L PELTERKKVI DKHTN I ATALLGQ I KERS I DA 
FTKKESDMMMRGGIDRTELMAALKGKGTKMDBCLRFAIMYLISTETINQSEVEAVEAALNEAEADT 
SAFQYVKKIKSLNASFAATSANSASRSNIVDWAEKLYGQSISAVTAGVKNLLSSDQQLAVTRTVE 
ALTEGKPNPEIDSYRFLDPRAPKSSSSGGSHVKGPFREAIVFMIGGGNYVEYGSLQELTQRQLTV 
KNVIYGATEILNGGELVEQLGLLGKKMGLGGPVASTLKRLGMAGKEETDVSAQGSLTREATEIWR 
SELES RRFQVD S LE AELVDVKAYLE FG SEE D ARKELG VL S GRVRS T ATMLRYLRSKARVLAIPDD 
LANVSCGVEQI EELKGLNL VEKDGG S S S S DGARNTNPE TRRYS G S LGVE DGAYTNEMLQS I EMVT 
DVLDSLVRRVTVAESESAVQKERALLGEEEISRKTIQIENLSVKIiEEMERFAYGTNSVLNEMRER 
IEELVEETMR QREKAVENEEELCRVKREFESLKSYVSTFTNVRETLLSSERQFKTIEELFERLVT 
KTTQLEGEKAQKEVEVQKLMEENVKLTALLDKKEAQLLALNEQCKVMALSASNI 
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A. 

OOP^molecule: CCP29 nucleotide sequence (kbpl3): 

ATC GCCAT GGCTGj^T GCT CT CAKATCT CTTGAG ATT GT T GAT GGT CT T G AT G AAT A 
•GATGAATCAATCTGAATCCAGTGCTCCGCATTCTCCAACCAGTGTAGCAAAGCTGCCACCAAGCA 
CTGCAACTAGAACAACTCGACGGAAGACCACAACAAAAGCTGAGCCTCAGCCATCATCTCAGTTG 
GTGTCCCGTTCTTGTGGTTCGACGAGCAAGTCTCTTGCTGGAGATATGGACCAGGAAAACATAT^A 

caagaatgttgctcaagaaatgaagactagcaatgtcaagtttgaagccaatgtgctcaaaactc 
cagcagcaggaagcacaaggaaaacttcagcagcaacttcttgcactaagaaggatgaattggtc 
cagtcggtctacagcactaggagatgaaccaggctgttagagaaatgtatggccgatctgagttt 
gaagactaaagaaactgtggataataaacctgccaagaatgaagatacagaacagaaagtatctg 

cacaggagaagaatctaactggt223 



B. 

CCP molecule: CCP29 amino acid sequence (kbpl3): 

MTNIAMADALKSLEIVDGLDEYMNQSESSAPHSPTSVAKLPPSTATRTTRRKTTTKAEPQPSSQL 
VSRSCRSTSKSLAGDMDQENINIO^VAQEMKTSNVKFEANVLKTPAAGSTRKTSAATSCTKKDELV 
QSVYSTRRSTRLLEKCMADLSLKTKETVDNKPAKNEDTEQKVSAQEKNLTG 
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A. 

CCP molecule: CCP30 nucleotide sequence (kbpl5): 

agcacccttgcatcctctcagaaagaagatgcagattcagcacgcttaccagcagatacatcagg 
ggtcaaaactgttgaagatggaccggatgatgttgagagggaccaaaaja|gaaggag^gcgtgag 
gaaaggaaacctgcaaagagagagaaggaagaaagacatgataggcgtgaaaaacgcgaaaggca 
tgagaagcgaagcgctcgtgattcagatgatagaaagaagcacaagaaagagaagaaggagaaaa 
aaagaaggcatgactctgattctgattgaagcgaattgtcccaggatggaacattttgctcttca 
gaggaagagtggtcggctaggtaccaaaatccagctaccacttctgcaagatttaaatctgttgc 
ttatttcatttacgaatcgtggagtaaagt^tg^gg 

B. 

CCP molecule: CCP30 nucleotide sequence (kbplS): 

ggctgataaatatagggagaactatttgggtcacagtatcaaagcccctgttggaagatggcaaa 

aaggtaaagatcttcattggtgggctagagataaaaagcaaaagggttccgagSSgatgctSgj 

aaagaagagattcaaagagttaaggaacaagaggagcaggccatgagggaggctcttggcttggc 

accaaagtcctct:acaaggccacaaggaaatcgccttgataagcaagagt:ttactgaacttgt^ 

agaggggttcgacagcagaggacttaggtgcagggaafcg^ 

ggatatgctaaagcaccacgaccttgggaagatccgagcacccttgcatcctctcagaaagaaga 
tgcagattcagcacgcttaccagcagatacatcaggggtcaaaactgttgaagatggaccggatg 
atgttgagagggaccaaagaaggataggcgtgaggaaaggaaacctgcaaagagagagaaggaag 
aaagacatgataggcgtgaaaaacgcgaaaggcatgagaagcgaagcgctcgtgattcagatgat 
agaaagaagcacaagaaagagaagaaggagaaaaaaagaaggcatgactctgattctgattgaag 
cgaattgtcccaggatggaacattttgctcttcagaggaagagtggtcggctaggtaccaaaatc 
cagct^cacttctgcaagatttaaatctgttgcttatttcatttacgaatcgtggagtaaa 

ccgagacgctcgttttagct 

C. 

CCP molecule: CCP30 amino acid sequence (kbplS): 

MLMLCGFTVLDMLKHHDLGKIRAPLHPLRKKMQIQHAYQQIHQGSKLLPGyiDRMMLRGTRRRIGVR 
I GNLHRESRKEDMI GVKNAKGMRSEALVI QMIERSTRKRRRRKKEGMTLILI EANCPRMEHFALQ 
RKS GRLGTKI QL PLLQDLNLLLI S FTNRGVKCC 

D. 

CCP molecule: CCP30 amino acid sequence (kbplS): 

MDAMKEEIQRVKRQEEQAMREALGLAPKSSTRPQGNRLDKQEFTELVKRGSTAEDLGAGNADAVW 
VHGLGYAKAPRPWEDPSTLASSQKEDADSARLPADTSGVKTVEDGPDDVERDQRRIGVRKGNLQR 
ERRKKDMIGVKNAKGMRSEALVIQMIERSTRKRRRRKKEGMTLILIEANCPRMEHFALQRKSGRL 
GTKIQLPLLQDLNLLL IS FTNRGVKCC 
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CGP molecule: CCP31 nucleotide sequence (kbp20): 

GCAAAAGAGAGAAA^TCTX^^ 

"AAGATGGTCTGAGCGACGA^GAGG^CTCGTCGATGmGGAAGCCTTGATGCTCTT.CGACACCGAT 
•GGGG&CGHG®^^ 

CCAAGCCCAGCTGAAATCCATAATCGCATCCGAGAATCTCTCTTCACCGTTTGATTTCAACAGAT 
TCCTCGATCTCATGGCGAAACATCTGAAGACGGAACCTTTGGATCGCCAGCTCCGTGACGCATTC 
AAAGTGCTCGATAAGGAAGGTACCGGGTTCGTTGCTGTGGCGGATCTGAGGCATATTCTGACCAG 
TATCGGAGAGAAGCTGGAGCCTAATGAGTTCGATGAGTGGATCAAG GAGG TGGATGTTGGATCCG 
ATGGAAAGATCCGGTATGAAGATTTCATAGCAAGGATGGTTGCTAAGg^GATCTAATCTTTTAT 
GTTTTGAAAGTTGAAATTTTTAAGAAGAGATTCTTTTGNGGTTTTTTCACTTGGTTGGTTTGATT 
TCGAGCGAATCCTAACTAGGGGTTGGTTTATCATTGNGGAATTTGCTTACTAACTTTGGCTTCTT 
CATGGTTGGGTTTCAATTTTTAATGGNAAATGGTGGCTGGGGGAATTCCTAAAAAAAAAAAAAAA 
AAAAAAAA 
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CCP molecule: CCP31 amino acid sequence (kbp20): 

MGKDGLSDDQVSSMKEAFMLFDTDGDGKIAPSELGILMRSLGGNPTQAQLKSIIASENLSSPFDF 
NRFLDLMAKHLKTEPFDRQLRDAFKVLDKEGTGFVAVADLRHILTSIGEKLEPNEFDEWIKEVDV 

GSDGKIRYEDFIARMVAK 
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A. 

CGP molecule: CCP32 nucleotide sequence ,p2E^fiC16^: 
caaaaaaaaaaatcact.tc aBBB gaaaaacaaaatect^ 

caacttctcaattgcgt cgcggagtctcctttcgatcaagagaaatgcgtccgatttttgcaatc 
tctcagagaatgcgtitctatcaaagaaagtaaagaagttctcgataccgagtcaagatcacgact: 
ctgagggagcagcttcag-ctecaaagagaccttcat^^cgttctttgttccgattttcttttatc 
gtttgagttgtaat:catgtaattgattttaatgtcatgccttggattcataagctgggtca-tgcc 
ttgtttcccctttgttgtcttgtatgttgaatattgcaaactctaaagagcatatttataagaag 
aaataaaagtttctacaaaaaaaaaaaaaaaaaaaa 

B. 

CCP molecule: CCP32 amino acid sequence (E2F5BBC16): SEQ ID NO:126 

MEKQSTQPICGQEALQLLNCVAESPFDQEKCVRFLQSLRBCVLSKKVKKFSIPSQDHDSEGAASA 
TKRPS 

c. 

CCP molecule: CCP32 amino acid sequence (E2F5BBC16): SEQ ID NO:98 

RGVSFRSREMRPIFAISQRMRSIKESKEVLDTESRSRL 



FIGURE 38 



WO 01/85946 



PCT/EB01/01307 



41/65 

A. 

CCP molecule^CCP33nucleotide sequence (DP): 

tagtactaggtcttggggcacggcggtttcaggtcaatctgtgtctactagcggcagtatgggct 
ctccgtcgagccggagtgagcaaaccatcaccgttgttacatctactagcgacactacttttcaa 
cgcctgaataatt -ggacattcaaggtgatgatgctggttctcaaggagcttctggtgttaagaa 
gaagaagaggggacagcgtgcggctggtccagataagactggaagaggactacgtcaatttagta 
tgaaagtttgtgaaaaggtggaaagcaaaggaaggacaacttacaatgaggttgcagacgagctt 
gttgctgaatttgcacttccaaataacgatggaacatcccctgatcagcaacagta-tgatgagaa 
aaacataagacgaagagtatatgatgctttaaacgtcctcatggctatggatataatatccaagg 
ataaaaaagaaattcaatggagaggtcttcctcggacaagcttaagcgacattgaagaattaaag 
aacgaacgactctcacttaggaacagaattgagaagaaaactgcatattcccaagaactggaaga 
acaatatgtaggccttcagaatctgatacagagaaatgagcacttatatagctcaggaaatgctc 
ccagtggcggtgttgctcttccttttatccttgtccagactcgtcctcacgcaacagtagaagtg 
gagatatcagaagatatgcagctcgtgcattttgatttcaacagcactccatttgagctccacga 
cgacaattttgtcctcaagactatgaagttttgtgatcaaccgccgcaacaaccaaacggtcgga 
acaacagccagctggtttgtcacaatttcacgccagaaaaccctaacaaaggccccagcacaggt 
ccaacaccgcagctgga-atgtacgagactcatcttcaatcgcaacaacatcagcagcattctca 

gctacaaatcattcctacgcctgagactaacaacg;^ 
aatccccgtctcttccagggataatgaactccagca^ 



B. 

CCP molecule: CCP33 amino acid sequence (DP): 

MT T T G £ frl S N HN HHE SNNNNNN P S T RS WGT AV S S VS T S G SMG S ^ ^^^^^^^^^^'1'^'^^ 
RLNNLDIQGD DAG S QGAS G VKKKKRGQRAAGP DKT^^S^S!^BSSSS^^3SuS^SSiS 
EgJHBAL PNNDGT S PDQQQYtmm^iECk^^ SLSDIEELK 
RE^SLRN RIEKKTA^^^ 

gjSg^ffi^^^^^^^^^^^^^^KFCDQPPQQPNGRNNSQLVCHNFTPENPNKGPSTG 
PTPQLDMYETHLQSQQHQQHSQLQIIPMPETNNVTSSADTAPVKSPSLPGIMNSSMKPEN 
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A. 

COP molecule: CCP35 nucleotide sequence 

gatggttaccaaaaccgaaggcaaaggtaatggcattaagaccaacattatcaacaatgttgaga 

ttgccaaagccttggctagaccgccttcttatacgaccaagtactttggttgtgagcttggagc 

cagtctaagtttgatgagaagactgggacgtcgcttgtgaatggagctcacaacacgtctaagct 

tgctgggcttttggagaattttattaagaagtttgttcagtgttatggatgtggtaacccggaga 

ctgagatta-ttattacgaagacgcagatggtgaatctcaagtgtgctgcttgtgggtttatctct 

gaggtcgacatgagggataagttgactaatttcattctcaagaacccacctgagcagaagaaggt 

gtcaaaggataagaaagcaatgaggaaagctgagaaggagaggcttaaagaaggcgagctagctg 

atgaggagcagagaaagctgaaagctaagaagaaagcattgtctaacggcaaggattctaagacg 

tctaagaaccattcttctgatgaggatataagcccgaagcatgatgagaatgctctagaggtgga 

tgaggatgaagatgatgatgatggtgtcgagtggcaaactgatacttcccgagaagctgctgaga 

aaagaatgatggaacagttgagtgctaaaactgccgaaatggtgatgctctctgcaatggaagta 

gaagagaaaaaggcgcccaaaagcaaatctaacgggaacgttgtgaaaactgagaatcctcctcc 

qcaaqaqaaqaatctcqtgcaggatatgaa agagtatctgaagaaagggtcaccaataagcgcgc 

tcaaaagtt-tcatctcgtctctctctgaacctcctcaagacatcatggacgcactcttcaatgct 

ctctttgatggtgtgggaaagggattcgccaaagaagtgactaagaagaagaattacttagcggc 

tgctgcaacaatgcaagaggatggatcacagatgcatctgctcaattcgattgggacattctgtg 

gaaagaatggaaacgaagaagctttgaaagaggtggctctggt:t:cttaaagcattgt:acgaccaa 

gacatcattgaggaagagg-tagtgttggattggtacgaaaagggtctcaccggagctgacaaaag 

ctcgccggtttggaagaatgttaagccttttgtggagtggcttcagagcgctgagtctgagtccg 

aacrac^^ 

ttcgtcagttttcagaattcttaaatctttttgctgtgttcttataaagaaacatcatctattaa 
agttgtcttcgtttggatttggttttgacgactttgggaaatatttatgtttaagaaaaaaaaaa 
aaaaaaaaaaa 



B. 

CCP molecule: CCP35 amino acid sequence 

MALQNIGASNRNDAFYRYKMPKMVTKTEGKGNGIKTNI INNVTSIAKALARPPSYTTKYFGCELGA 
QS KFDEKT GTS L VNGAHNT S KL AGLLENF I KKFVQC YGCGN PETE I 1 1 TKTQMVNLKC AACG FI S 
EVDMRDKLTNFILKNPPEQKKVSKDKKAMRKAEKERLKEGELADEEQRKLKAKKKALSNGKDSKT 
SKNHSSDEDISPKHDENALEVDEDEDDDDGVEWQTDTSREAAEKRMMEQLSAKTAEMVMLSAMEV 
EEKKAPKSKSNGNWKTENPPPQEKNLVQDM KEYLKKGSPISALKSFISSLSEPPQDIMDALFNA 
LFDGVGKGFAKEVTKKKNYLAAAATMQEDGSQMHLLNS I GTFCGK1SJGNEEALKEVALVLKAL YDQ 
DIIEEEWLDWYEKGLTGADKSSPVWKNVKPFVEWLQSAESESEEED .» .'■ 1 
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CCP molecule: CCP36 nucleotide sequence 

atacgcttttctcgaatggtcactcattttcttcattttgctcaactctctcttttcttatttca 
tactcagattcgctgattatttcggtcttaaacgtccttgtctcttctgctctagactcgatcgt 
ttcttcgatgcttctggtaaatctccttctcatcgagatcttctctgcgatgatcatgctctcca 
attacattcaaaacctgttgaagaatctaattgtggtttcggagaatttcacaatgatttggttc 
atcgtggttgttgcgtagagaagataagttcgtcactatgtgctccgattgagtctgactttggg 
aatttagattatccaattggagatgaaggtcagatttacaatggtcttaagtttcctcgatcgat 
cttcgtctttgaagaagagaaagtaggatctgtaaatttgaatgattctcaggaagaaacagagg 
agaagaaagttccccaatctcatgagaaacttgaagatgatgatgttgatgaggagttttcatgc 
tatgtatcaagcttcgattgtaagaacaaagaaattgcaacagagaaggaagaagaaaacagagt 
ggatctacctatagaggtggaaactgcagaatcagctccgaaaaacctcgagttctatattgatg 
aagaagactgtcatttgattccagttgaattctataaaccgagtgaagaagttcgagagatttcc 
gacattaacggagattttatcctcgatttcggcgttgagcatgatttcacggcggcggcggagac 
ggaggaaatctccgactttgcttcgccgggtgaatcgaaaccggaggatgcagagacgaatctag 
ttgcttcggaaatggaaaacgacgacgaagaaacagacgcagaggtttctataggtacagagatt 
cctgatcatgagcaaatcggagatattccttctcaccagctcattcctcaccacgatgacgatga 
tcatgaggaggaaacgttggagttcaaaacagtaacgattgaaaccaagatgccagtcttaaaca 
tcaacgaagagcggattttagaagctcaaggctcgatggaaagctcgcatagtagtctacataac 
gctatgtttcacttagagcaaagagtatctgttgatggtattgaatgtcctgaaggagtactcac 
tgttgataagttgaagtttgagttacaagaagagagaaaagcacttcacgcgttatacgagga£c 
tggaggtagagaggaatgcgtctgctgttgctgccagtgaaacaatggcgatgatcaataggttg 
catgaggaqaaagctgcgatgcagatggaagcgttgcagtatcagagaatgatggaggagcaagc 
tgagtttgatcaagaagctttgcagttgttgaatgagcttatggtgaatagagagaaggagaatg 
ctgagcttgagaaggagctagaggtgtatagaaagagaatggaggagtatgaagctaaagagaaa 
atiggggatgttgaggaggagattgagagattcctctgttgattcgtatagaaataatggcgattc 
tgatgagaat.agcaatggagagttacagtttaagaacg-btgaaggggtt:acggat1:ggaaatata 
gagagaatgagatggagaatacgccggtggatgttgtacttcgtcttgatgagtgtttagatgat 
tatgatggagagaggctttcgattcttgggagattgaagtttcttgaagagaaactcacagatct 
taataacgaagaggacgacgaggaggaggctaaaacgtttgagagtaatggtagcatcaatggaa 
atgagcatattcatggcaaagaaacaaacgggaagcacagagttatc a agtcaaa g agatta 

in E2F3ca2: c a 
cttcccctgtttgatgcggtcgatggagagatggaaaacgggttaagtaacggaaaccatcacga 
aaacgggtttgatgattcggagaagggtgagaatgtgacgatagaagaagaagtggatgagcttt 
acgagaggttagaagctctagaggcagatagagagttcttaagacattgtgttggttcattgaaa 
aaaggagacaaaggtgtacatctcctccatgagattcbgc aac atcttcgtgatctaaggaatat 
cgatcttactcgcgtcagagaaaacggagacatgaqttt^ 

tgagtccactcttt:gcatagtgacccaaagaacaagaaaaatcatacaggtatggaagtgacat:g 
ttgcttgtgaggcaaggaacaacgacaaggtttcagatqaagaagaaaacgttctcagaataaaa 
gtattttaagtatat:actctgaggaaaagtgtcaqatcagaatgttcgtctttct1:cgttcattt 
tcattattataagttttgttttttatattgaaqatttatttagagagagggaagtgtcagtataa 
tttcacttttatattttatatttgggagttgtctt-tatgagtgqtggtaatagaaaaaggtagaa 
tgatgagtgaagaaaaaaaaaaaaaaaaaaaaaa . . , ■ < - 
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CCP molecule: CCP36 amino acid sequence 

MAftNKFATLIHRKTNRI TLI LVYAFLEWSL I FFI LLNS LFS YFI LRFAD YFGLKRPCLFCS RLDRFFDAS G 
KSPSHRDIiLCDDHALQLHSKPVEESNCGFGEFHNDLVHRGCCVEKISSSLCAP.IESDFGNLDYPIGDEGQI 
YNGLKFERSIFVFEEEKVGSVNLND 

ENRVDL P I E VET AE S APKNLE F Y I DEE DCHLT PVE F YKP S EEVRE IS D ING DF I LDFGVEHDFT AAAETEE 
ISDFASPGESKPEDAETNLVASEMENDDEETDAEVSIGTEIPDHEQIGDIPSHQLIPHHDDDDHEEETIiEF 

KTVTIETKMPVLNINEERILEAQGSIffiSSR^ 

LHAL YEE LEVERNAS AVAAS E TMAM I NRLH EEKAAMQMEALQYQRMMEEQAE F DQEALQLLNELMVNREKE 
NAELEKELEVYRKRMEEYEAKEKMGMLRRRLR 

ENTPVDWLRLDECLDDYDGERLSILGRLKFLEEKLTDLNNEEDDEEEAKTFESNGSINGNEHIHGKETNG 
KHRVIKSKRLLPLFDAVDGEMENGLSNGNHHENGFDDSEKGENVTIEEEVDELYERLEALEADREFLRHCV 
GSLKKGDKGVHLLHEILQHLRDLRNIDLTRVRENGDMSL 
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CCP molecule: CCP36 nucleotide sequence 

grajtcagacgctctttctgcgattccggccgcagttcatcgcaatctctccgataaactctatga 
gaagcgcaaaaatgctgcgcttgagcttgagaatattgtgaagaatctaacttcttcgggtgatc 
atgacaagatctcgaaagtcattgagatgttgattaaggaatttgccaaatctcctcaagctaat 
catcggaagggtggtctaattggcttagctgctgtaactgttggtttgtctacagaagctgctca 
atatcttgagcaaatagtgccacctgtgattaattccttttctgatcaagatagccgagttcggt 
actatgcatgtgaagctctctataacattgcaaaggttgtgcgaggcgatttcattattttcttc 
aat.aaaatatttaatqc cttatqcaaactctcagcaqattctq atqccaatqtccaaaqtgctgc 
tcatcttttqqatcqccttqttaaqqatattqtqacgqaaaqtqat caqttcagtattgaggaat 
tcatacctcttttaaaagaqcqaatqaacqttctcaacccttacqt ccqqcaatttctgqttgga 
tqqatcactgttcttqataqtqttccaqacattgacatqcttgg qtttctqccaqactttctcga 
tqqqttattcaatatqttqaqcqactctaqtcatqaaatacqacag caaqctqattcaqctcttt 
caqaqtttcttcaagagataaaaaattcaccatctqtaqatta tqqtcqcatqqctqaaatactg 
qtqcaqagqgctqcttctcctqatqaattcactcqattaacaqccatcacgtqqataaacqa gtt 
cqtaaaacttqqqqqaqaccaqctcgtgcqttattatqctqacatt cttqqqqctatcttgcctt 
qcatatctqacaaagaaqaqaaaatcaqqqtqqttq ctcqtqaaaccaatqaaqaacttcgttca 
atccatqttqaaccctcaqatqqttttgatqttqqcqcaattct ctctqttqcaaqqagqcagct 
atcaagtqaqtttqaqqctiactcqqattqaaqcattqaattqq atatcaacacttttaaacaagc 
atcgtactqaqqtcttqtqcttcctqaatiqacatatttqacac ccttctaaaaqcactatctgat: 

■~ in E2F3C3.9 

tcttctqatqacqtqgtgctcttqqttctqqaqqttcatqctqqtq taqcaaaaqatccacaaca 
ctttcqccaqctcatcgtatttcttqtccacaatttccqaqctqataattctcttttqgaaa ggc 
qcqgtqcccttattqtccqaaqaatgtqtqtacttttqqatqcc qaaaqaqtctaccqagagctc 
tctacaattcttqaqqqaqaaqataatcttqactttqcttctacca tqqttcaqqcattgaattt 
qattttqcttacttccccqqaqttatcqaaactqaqaqaactattaaa aqqt-tcactcqtcaatc 
qcqaaqggaaaqaacttttcqttqccttqtatacttcatqqtq ccattcacccatqg-caattat 

aaqcctctqcttattaqctcaqqcttacca - acatqcgaqtqtcgt gattcaatcattggtagaa 

in E2F3ca9 a a c c a 

qaaqacattaacgtc - aaatttct - aqtacaqcttqataaa -ttqatccqgcttctqgaaactcc 

c t gc a in E2F3ca9 

aatctttacttaccttaqattqcaqcttctggaaccaqqaaqqtacacat qqttqctgaaaaca:c 
tttatggtcttcttatqttacttcctcagcaaaqtqcgqcqttcaaqatacttaqqacaaga ctc 
aaaactgtqccaacqtactcattcaqtactqqaaaccaaata qqcaqaacaacttcaggagttcc 
tttctctcaqtataaqcatcaaaac'qaqqacggtqacttaga aqacqataacatcaacagttctc 
accaaqqaatcaattttqctqtgcqqctacaacaqttcgaaaacqtaca qaatctacatcqtggc 
caqgcaaqqactaqagtqaactactcatatcactcttcctcttcttctaca tcaaaqqaqqtqag 
gaqatctqaaqaacaacaacaqcaqcaqcaqcaacaacaacaqcaacaa caacaacaacaacgac 
caccaccttcttcgacatcatcatcaqttqcaqataacaataqa cctccatcaaqaacttcaaga 
aaaggccctqqtcaattacaqcttH^cctaccLqgtaatcata aataataaataatattccatc 
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cccqacaatcatcatcttcatcttct^^ 

ttgtatatctct:ct:ttt:ttagtaact:cttcaagttt:cgacggaactt:gtggaaaagctacggtcg 

tgt^ca^cat^ctc^ttc^ 

agtctacctttatattgt^ 

ctttttatatttttaccttatgtggagatgtaagaaaaagaagtga-tcatgtggttttgtgttgt 
ttttttataac-tggaaaaccacatgagtttgtagaggtcacttattggatattttatgtcaaatg 
atgctcctttttacaaaaaaaaaaaaaaaaaa 



FIGURE 43 (continued) 
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CCP molecule: CCP36 amino acid sequence 

MSDiyjSAIPi^VHRNLSDKLYEKRKNAM.ELENIVKNLTSSGDHDKISKVIEMLIKEFAKSPQAN 
HRKGGLIGLAAVTVGLSTEAAQYLEQIVPPVINSFSDQDSRVRYYACEALYNIAKWRGDFIIFF 
NKI FD ALCKLSADSDANVQSAAHLLDRLVKDIVTESDQFSIEEFIPLLKERMNVLNP YVRQFLVG 
WITVLDSVPDIDMLGFLPDFLDGLFNMLSDSSHEIRQQADSALSEFLQEIKNSPSVDYGRMAEIL 
VQRAASPDEFTRLTAITWINEFVKLGGDQLVRYYADILGAILPCISDKEEKIRWARETNEELRS 
IHVEPSDGFDVGAILSVARRQLSSEFEATRIEALNWISTLLNKHRTEVLCFLNDIFDTLLKALSD 

S S D D WLL VLE VHAG VAKD P QH FRQL I VFL VHN FRADN S L LE RGAL I VRRMC VLL DAE RVYRE L S 
TILEGEDNLDFASTMVQALNLILLTSPELSKLRELLKGSLVNREGKELFVALYTSWCHSPMAIIS 
LCLLAQAYQHASWIQSLVEEDINVKFLVQLDKLIRLLETPIFTYLRLQLLEPGRYTWLLKTLYG 
LLMLLPQQSAAFKILRTRLKTVPTYSFSTGNQIGRATSGVPFSQYKHQNEDGDLEDDNINSSHQG 
INFAVRLQQFENVQNLHRGQARTRVNYSYHSSSSSTSKEVRRSEEQQQQQQOQQQQQQQQQRPPP 
SSTSSSVADNNRPPSRTSRKGPGQLQL 
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CCP molecule; CCP37 nucleotide sequence 

cgccggaatatcctccattcga'tgctcaa'tttctgcaccggagaagaaaccga-ggaggaggagga 
agcagaagcgcggcgacggagctgagaatgacgactctttgtctttcggaagtggtgaagctgtc 
tccgctctggagaggagtctccgcctcacttttatggacgagcttatggaacgagctagaaatcg 
agatacttcaggtgtttctgaggttatctatgacatgattgctgctgggcttagccctggacctc 
gttctttccatggtttggttgtagctcacgcgcttaacggcgacgaacaaggcgcgatgcactcg 
ctgagaaaggagctaggtgcaggccaacgtccgcttcctgagactatgattgctttggttcgtct 
ctctggttcgaaagggaatgctacgagaggcctagaaatcctcgccgctatggaaaagcttaagt 
atgacattcgtcaagcttggctcattcttgttgaggagctcatgaggatcaatcacttggaagat 
gcgaataaagttttcttgaagggtgcaagaggtgggatgagagcaacagatcagctttatgattt 
gatgattgaagaagattgcaaagctggagatcattctaatgccttagacatctcttacgaaatgg 
aggcagctggtagaatggccacaacatttcatttcaactgtcttcttagtgtgcaggctacatgt 
gggattcccgaggtagcttatgctacattcgaaaatatggagtacggtgaaggtttatttatgaa 
gcctgacactgagacatataactgggtgattcaagcctacactagagccgagtcatatgataggg 
ttcaggatgttgctgaattacttggaatgatggttgaggaccacaaacgtgtgcagccaaatgtg 
aagacttatgcgctcttagttgagtgcttcaccaaatattgtgtcgtgaaggaagcgattagaca 
ttttcgtgctcttaaaaactttgaaggaggaacagtaattttacacaatgcagggaattttgagg 
atcctctctctttgtatctcagggctttgtgtcgagaaggaagaattgttgagcttattgatgct 
ttagatgcaatgcgcaaagataaccaacctatacctccaagagccatgattatgagcagaaagta 
tcgaacactagtcagctcatggattgaaccattgcaagaagaagctgaacttggctatgagattg 
attatttagcgaggtacatagaggaagggggacttactggtgaacgcaagcgttgggtacctcga 
agagggaaaactcctttagatcccgatgcttctggttttatatactcaaaccctattgaaacatc 
ctttaaacagagatgccttgaagattggaaagttcaccataggaagctcttgagaaccttacaga 
gtgaaggtcttccagttctaggagatgcatcagaatctgattacatgagagtggtggagagatta 
cggaacataataaaaggtcctgcactgaatcttttgaagccgaaagcagcaagcaagatggttgt 
atcagagttaaaggaagaactcgaagctcagggtttgccaattgatggaacaagaaatgtgcttt 
accagcgtgtccaaaaagcaaggagaataaacaaatctcgaggtcgacctctttgggttcctcca 
attgaagaagaagaggaggaggtcgatgaagaagtagacgatttaatatgtcgaatcaagctaca 
taaaaaaaacacaqaattctqgaaacatcqqtttcttqqagaagg cttgatt:gaaacttcagttg 
aatccaaggaaacgactgaatcagtggttacaggtgaatcggagaaagcgattgaagatatttca 
aaagaagctgacaatgaggaggatgatgatgaggaggaacaagaaggagatgaggatgatgatga 
aaatgaagaggaagaagtggttgttccagaaactgagaatcgagcagaaggagaagatttagtga 
agaataaggcagctgacgcgaagaagcatcttcaaatgattggagtccaactcttgaaagaatcc 
gatgaagcaaacagaacaaagaaacgtgggaagagggcatctcgtatgacacttgaggatgatgc 
agatgaggattggttccctgaggaaccatttgaagcattcaaagaaatgagggaaagaaaagtgt 
tcgatgtggctgacatgtatacaatagcagacgtttggggttggacatgggagaaggattttaag 
aacaaaactccaaggaaatggtcacaagagtgggaag'bcgagttggcaattqtgctcatgacaaa 
ggtgattgaattgggtggaattccaacgattggtgattgtqcagtgatattacgagctgctttaa 
gagctccc 
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atqccttcaqccttcttqaaqatcttqcaqacgacacacagtcttQgct actcatttqgcagccc 
qttqtacgatqaqatcatcacattgtgtttqqaccttqqaqaact tqatgcagccatcgccatag 
ttqcagatatqgaaaccacaqggatcactgticcctgatcaaaccctt qacaaggtcatatctgct 
aqacaatctaatgagagtccqcqqtctqaqcctqaaqaqccaqcatcaacaqtaaqctctBBS tt 
atcatatcctcttctqcttqttqtgaaqtctctatoagaaacaa aaatcgcFfcacraaqgagctgaa 
tctqtcttaqttatgaaaqttttqttcattataagtacaagtcatqtag ttccqagtgtagaaca 
qtttttactagtgttgcaccaggtccctccaqtctga-bacttaattc tttaqtqttqgatctttc 

tatataaqaaaaaaaaaaaaaaaaaa 



FIGURE 45 (continued) 



WOOl/85946 



PC-T7IB01/01307 



50/65 



GGP mdlecuIe:^GGP37^amino ;acia sequence 

.MS'LIaFEN REFRSNS I HE I BRRfiJ^GX S S PRCS I SAPEKKEBRRRKQKRG tDGAENDDSLS EG S GEAVS ALERS 
^Ra^FMDEIMERA^^ 

PETMI ALVRLSGSKGNATRGLE I LAAMEKLKYDI RQAWLI LVEELMRI NHLEDANKVFLKGARGGMRATDQ 
LYDLMI EE DCKAGDH SNALDI S YEMEAAGRMATT FH FNCLLS VQATCGI PEVAYAT FENME YGEGL FMKPD 
TE T YNWVI QAYTRAE S YDRVQD VAELLGMMVE DHKRVQ PNVICT YALL VEC FT KYC VVKE AIRH FRALKNFE 
GGTVILHNAGNFEDPLSLYLRALCREGRIVELIDALDAMRKDN.QPIPPRAMIMSRKYRTLVSSWIEPL'QEE 
AELGYEIDYLARYIEEGGLTGERKRWPRRGKTPLDPD ASG 

QSE GLPVLGDASESDY^VVERLRNIIKGPALNLLKPKA ^^^^3^^^^5^S^^ ^^^BE^^^E^B^^^E 

tg^ RINKSRGRPLWPPIEEEEEEVDEEVDDLICRIKLHEGDTEFWKRRFLGEG LIETSVES 

GESEKAIEDISKEADNEEDDDEEEQEGDEDDDENEEEEVWPETENRAEGEDLVKNKAADAKKHLQMIGVQ 

LLKESDEANRTKKRGKRASRMTLEDDADEDWFPEEPFEAFKEMRERKVFDVADMYTIADVW 

KTPRKWSQEWEVELAIVLMTKVIELGGIPTIGDCAVILRAALRAPMPSAFLKILQTTHSLGYSFGSPLYDE 

IITLCLDLGELDAAIAIVADMETTGITVPDQTLDKVISARQSNESPRSEPEEPASTVSS 
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A. 

CCP molecule: CCP38 ami no ac id sequence 

aagcttcgaagtcgatttcaS^a^^tcctogtcagccatcgcgaggaagacatgggagcta 

gagaa^aar-ai-.f-etcecaataaaacc aaccqattcag cctccqacacrtatattccactacgacga 

cgcttcacaaqccaaaatccaqcaqqaqaaqccatgqgcctccqa tcctaactacttcaagcgcg 

ttcacatctcaqcccttqctcttctcaaqatqqtgqttcacqctcqctccqqtqqcacaa t c gag 

atcatqqqtcttatqcaggqtaaaaccqaqqqtqat acaatcatcqttatqgatqcttttgcttt 

qcctqttgaaqqtactqaqactagggttaatqctcag tctqatqcctatqaqtatatqgttgaat 

actctcaqaccaqcaaqctqgctqqqaqqttqqaqaacqttqttgqatqgtatca ctctcaccct 

q qqtatqqatqttqqctctcqqqtattgatqtttcqa cacaqatqcttaaccaacaqtatcagga 

qccattcttagctqttqttattqatccaacaaqqactgttt cqqctqqtaaqqttqagattgggg 

cattcaqaacatatccagagqqacataaqatctcqqatqat catqtttctqaqtatcaqactatc 

cctcttaacaaqattqagqactttggtqtacattgcaaacaqtactactcattggac atcactta 

tttcaaqtcatctctcgatagtcaccttctqqatctc ctttqqaacaaqtactqgqtqaacactc 

tttcttcttccccactqttqgqcaatgqaqactatgt tqccqqqcaaatatcaqacttggctgag 

aaqctcqagcaaqcgqaqaqtcaqctcqctaactcccqqtatqqaqqaattqcgccaq ccggtca 

ccaaaqqaqqaaagaqgatqaqcctcaactcqcqaaqataactcgqqataqtgcaaa qataactg 

tcqaqcaqqtccatqqactaatqtcacagqttatcaaaq acatcttqttcaattccqctcgtcag 

tccaaqaaqtctqctqacqactcatcaqatccagaiqcccatg attaeatGglSBaqttqgtctat 

tcttttqttttttqqctgcqqaaattqactatcqqtttqacccq qtttatqaqqcaatqcccatt: 

qttccctatatctctaqtgtaqtatctqcttcaqacaaaqatcttt gqqttattaaatgacatta 

acataaatcgatcattatqtttttgcgttaaaaaaaaaaaaa aaaaaaaaaaaaaaaaaa 



B. 

CCP molecule: CCP38 amino acid sequence 

MPf^QggaTaRK'TWF.T.F.MMT'LPVEP TDSASDSlFHYDDASQAKI QQEKPWA SDPNYFKRVHISALALLKMW 
HARafiGTIEIMG LMOGKTEGDTIIVMDAFALPVEGTETRVNAQSDAYEYMVEYSQT SKLAGRLENWGWYH 
SHPGYGCWL SGIDVSTQMLNQQYQEPFLAVVIDPTRTVSAGKVEIGAFRT YPEGHKISDDHVSEYQTIPLN 
KIEDFGVHC KOYYSLDITYFKSSLDSHLLDLLWNKYWVNTLSSSPLLG NGDYVAGQISDLAEKLEQAESQL 
ANSRYGGIAPAGHQRRKEDEPOLAKITRDSAKITVEQVHGLMSOVIKDIL FNSARQSKKSADDSSDPEPMI 

TS 
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description of molecule 


amino acid 
sequence 

ISEQIDNO: 


nucleic acid 
sequence 
SEQ ID NO: 


Tag* 100 epitope 


199 




c-myc epitope 


200 




FLAG^-enitone 


201 




T-TA pt>i tn~np 

IxrV CpXlAJpC 


202 




piULCXil CLJXtV/pG 


203 




VSV enitone 
v o v c^jinjpw 


204 




TjP frincfkrv^H rYNTA HinrHno" 

JL/JT L/LUXot/l VCU JL/lN^i. UXXXIXXJAg 


240 




TVP f*nn<5ervpfl HetprAdim domain 

XJX. vUXlObl V&U UU LvX VUXJUJ. V4.WJULXi,4,l.,I.JL 


241 




TYP pnTicprupH ViRfpfrwIi'm fIfYnrYJiiTi 

J-/JT l^VJlXoCx VCU AXdWl VlXXXX vlVJXXXoXXl 


242 




tinmpi* A 

pXXXXlCX t\. 




243 


priiiicr 13 




244 


pr liiier 




245 


CLLLO 1 oILC 




246 


xVUZiUv COIlbCIlbLib 




247 


dlLDZ, oilC 




248 


oCIloC JZ/ZP a px XIXLCx 




249 


cixxUbCIxbC 1-jAr a. pxllxxCl 




250 


bcnsc Ufa prixxicr 




251 


dllllbCllbC JL/X CI pxlXXXCX 




252 


J>ClloC V_/J_-/XV^v pXXXXXv^X 




253 


antisense CDKA primer 




254 


cen^e CTDTCR DTimer 




255 


antisense CDKB primer 




256 


sense histone H4 primer 




257 


antisense histone H4 primer 




258 


sense roc5 primer 




259 


antisense roc5 primer 




260 


sense actin primer 




261 


antisense actin primer 




262 


CDK phosphorylation motif 
CDC2bDN-IC26M 


263 
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description of molecule 


amino acid 


nucleic acid 




sequence 
SEQ ID NO: 


sequence 
SEQ ID NO: 


ICK4 


264 




forward sequencing primer prml024 




265 


reverse sequencing primer prml025 




266 


cyclin destruction box 


267 




cyclin box consensus motif 1 


268 




cyclin box consensus motif 2 


269 




CDC2 consensus motif 1 


270 




CDC2 consensus motif 2 


271 




CDC2 consensus motif 3 
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CDK phosphorylation site consensus 1 


273 




CDK phosphorylation site consensus 2 


274 




CDK phosphorylation site consensus 3 


275 




CDK phosphorylation site consensus 4 


276 




NLS consensus 1 


277 




NLS consensus 2 


278 




NLS consensus 3 


279 




NLS consensus 4 


280 




Cy-like box consensus 


281 




Rb binding domain consensus 1 


282 




Rb binding domain consensus 2 


283 




Rb binding domain consensus 3 


284 




Rb binding domain consensus 4 


285 




DEF domain 


286 




DNA binding domain 


287 




DCB1 domain consensus 1 


288 




DCB1 domain consensus 2 


289 




DCB2 domain 


290 
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SEQUENCE LISTING 

<110> CROPDESIGN N.V. 

<120> NUCLEIC ACID MOLECULES ENCODING PLANT CELL CYCLE 
PROTEINS AND USES THEREFOR 

<130> CNN-001PC 

<150> US 60/204,045 
<151> 2000-05-12 

<160> 290 

<170> Patentln version 3.0 

<210> 1 
<211> 1255 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 1 

ccacatatcc gtgatgagga 
acgattatca ttgatgtgga 
tttgtacaac atactgaagc 
atggaagatg cagacaaaga 
aatcctttgg ctgcggttga 
aaacttagtt gcgtgcctcc 
agaggaatcc tcattgactg 
actctttatc tcacaatcaa 
aaaaagcttc agcttgttgg 
tcagttccag tggtagatga 
gtgctagata tggagaagct 
ccatatgttt tcatgaaacg 
ttatcattct ttatgatcga 
tctaagctgg cggcctcagc 
tggagcaaaa cctgtgagtt 
agaaagatgg ttgctttcca 
aagtacaaca catctaagtt. 
taatattaat aagaatctaa 
tgagagaaag agagatagta 
tatcatggca agtagcccaa 
cttcacttga gagagaataa 



<210> 2 

<211> 471 

<212> DNA 

<213> Arabidopsis thaliana 

<400> 2 

cccgattcgg gtactgctgc tggtgggtca aactccgacc 

gttcttgtcg ttgatgatga tccaacttgt ctcatgatct 

tgtctctaca gagtaactaa atgtaacaga gcagagagcg 

aacaagaatg gttttgatat tgtcattagt gatgttcata 

aagctccttg aacacgttgg tttagagatg gatttacctg 

gattcgaaga gcgttgtgtt gaaaggagtg actcacggtg 



aactaagaaa 
tgaaagtgat 
aatgctggag 
agaagagcct 
atatatccat 
taactatatg 
gttaattgag 
tgtcatcgac 
tgttactgct 
tctcatcttg 
aatggccaac 
atttctcaaa 
gctttgcctt 
aatctacact 
tcacacaggc 
tcacaaggca 
ctgtcatgct 
tatgacttaa 
gagcaagttg 
aaagagtgta 
aaagaatctt 



ccagactcag 
aaagaaggag 
gagattgaac 
gtgatcgata 
gatatgcata 
gacaatcaac 
gtgcactaca 
agattccttg 
ttgttgcttg 
atctctgaca 
accttgcaat 
gctgcccaat 
gtggagtatg 
gctcagtgta 
tacaacgaaa 
ggaacaggga 
gcaagaactg 
ctcgagtttt 
cccaaaatgg 
ttcttctctt 



tttcaagtga 
gtgactctaa 
agatggagaa 
ttgatgcctg 
ccttctacaa 
aagatcttaa 
agtttgaact 
cggttcatca 
catgtaaata 
aagcttactc 
tcaatttctc 
ctgacaagaa 
agatgctaga 
cacttaaggg 
aacagctact 
agctcacagg 
aaccagctgg 
tctttagaac 
gagaagaatg 
tctaaggtct 



agaaccagag 
tgagccaatg 
ggagattgaa 
tgataagaat 
gaattttgag 
tgagagaatg 
gatggaggaa 
aatcgtgagg 
tgaagaagtt 
tagaagagaa 
tctaccaact 
gcttgagatt 
gtatcttcca 
atttgaagaa 
ggcatgtgcg 
agttcacaga 
gtttctgatt 
aaaaagagtg 
gatctttaga 
ttagatcttt 



ctgaaaaaaa aaaaaaaaaa aaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1255 



cgtttcctgc gaatcttcga 60 

tagagaggat gcttatgact 120 

cattgtctct gcttcggaag 18 0 

tgcctgacat ggatggtttc 240 

ttatcatgat gtctgcggat 300 

cagttgatta cctcatcaaa 360 
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ccggtacgta ttgaggcttt gaagaatata tggcaacatg tggtgcggaa gaagcgtaac 4 20 
cgagtggaat ggttctgaac attctggagg aagtattgaa gatactggcg g 471 



<210> 3 

<211> 1351 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 3 

atggggaagg 

gcttcagaag 

ttggaggata 

atcacaaatg 

aaaggacgcg 

gtcacagatc 

aacctttctg 

tctagattgc 

attggtagtt 

gatcctttac 

cttaaacgca 

atgcggggaa 

gacactctct 

agacagcaac 

atctctgctc 

caggtcctgg 

actccaaaaa 

agccttgaag 

ttcttaaagt 

atggaccaat 

tcggatctga 

cccttgagcg 

acgtctccaa 



aaaatgctgt 
tgacttctac 
caagagccac 
ttaactccaa 
gtcatggatt 
ttcagtccag 
tttctaaagg 
ctccaagacc 
caactgtacc 
tgtgctgcct 
gaccacttcc 
ttctggttga 
acctcacagt 
ttcaactgct 
cacgcattga 
aaatggagaa 
cgttccttag 
tcgagtttct 
ttcttccttc 
caaaccaccc 
aagcatctgt 
ctatacgcat 
agctacttga 



gtctcggcca 
tacacagaat 
tggacccaac 
tacagctata 
ggcgagtaca 
gaccgatgca 
cacagataac 
tcttgggaga 
ggatatccca 
ctatgcccct 
ggactttatg 
ttggcttgtg 
gtatctcata 
cggcatcact 
ggagttttgc 
ccaagtactt 
gagatttctc 
agccagctat 
cgttgttgct 
atggaatcca 
tcatgcctta 
gaagtatagg 
cacgctattc 



ttcactcgtt 
caacagagag 
aagaggaaga 
cttgaggcca 
tcccagttgg 
aaagttgaag 
acagctgata 
tcagcttcta 
aaatttgtag 
gaaatccact 
gagagaatac 
gaggtctctg 
gactggttcc 
tgcatgctaa 
ttcattacgg 
aagcatttta 
agagcagctc 
ctaacagagt 
gcttcagcgg 
acacttgagc 
caagatctgc 
caagagaaat 
t 



cccttgcctc 
taaacacaaa 
agcgagcggt 
aaaacagcaa 
caacttctgt 
ttgcatcaaa 
actgtattga 
cagctgagaa 
acatcgattc 
acaatttgcg 
agaaggatgt 
aagaatacac 
tccatggaaa 
ttgcctcgaa 
ataacaccta 
gctttcaaat 
aagcctctcg 
tgacattaat 
gttttctcgc 
attacacaac 
agcttaacac 
acaaatctgt 



tgctttgcgc 
aagaccagcc 
tctaggggag 
gcagataaag 
tacttcagaa 
tacagcagga 
gatatggaat 
aagtgctgtt 
agatgacaag 
tgtttcagag 
cacccagtcc 
acttgcatct 
ctacgtgcaa 
gtatgaggaa 
cacaagagat 
atacactccc 
cctgagccca 
agactaccat 
caagtggaca 
gtacaaagca 
caaaggttgc 
ggcggttctc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200. 
1260 
1320 
1351 



<210> 4 

<211> 672 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 4 

atggggaaga 

gctagtttat 

cacacgcgtt 

cttcgtcttg 

ggtggccgtg 

gagtccgaag 

gatgatgatg 

gatgaggatg 

caagttcctc 

aagaggacga 

tcttcagctc 

gttgttgttt 



agtgtgattt 
gttgggattg 
gtcttctctg 
gcccaacttt 
gaaacagagt 
aagatcggat 
aggaagaaga 
gtgatgatga 
cggtgatgag 
gggctagaga 
aagggtcaaa 
aa 



atgtaacggt 
cgacggtaaa 
tagcgcttgt 
ctccgtctgc 
tttatcggag 
tagagaagat 
tgaggatgaa 
ggaagcggag 
ttcttcatct 
gaattcagat 
ctattctcgg 



gttgcaagaa 
gttcacggcg 
cagtctctta 
gagtcatgcg 
aatcgtggtc 
cacggtgacg 
gagtacagcg 
aatcaagttg 
tctgacggag 
cttctctgct 
ccgttgaagc 



tgtattgcga 
ctaatttctt 
cgccgtggaa 
tcgctcttaa 
aggaggaggt 
gtgacgacgc 
acgatgagga 
tgccgtggtc 
gaagcggagg 
ccgatgatga 
gatcggcgtt 



gtcagatcaa 
ggtagctaaa 
agctactggg 
aaacgccggc 
taatagtttc 
ggagtcttac 
tgaggatgat 
tgcggcggcg 
ttcagtgacg 
gatcggaagc 
taaatcaacg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
672 



<210> 5 
<211> 1287 
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<212> DNA 

<213> Arabidopsis thaliana 



<400> 5 

atggttaact 

gatgaaacaa 

cgtgtgatta 

ggaagcttat 

tctttgcgtc 

ccctcagttc 

acattggacc 

gaagaagttg 

gactcgaaga 

acaatggaga 

gagaagatga 

atgaacgaga 

gttatgagaa 

gaggaggttt 

aggaacgatg 

ttaccgacac 

tgtgaggtct 

tttccaccat 

tccaggaaat 

gaatgttcac 

gtatatagga 

tttctagtgt 



catgcgagaa 
gaagtagaaa 
accagaatct 
tgtctaataa 
cttcagttac 
caagtgctaa 
ttccaatgcc 
agatggagga 
actcgcttgc 
gatttagttg 
gagcaatact 
cactgtttct 
agaagcttca 
cggttcctgt 
ttctagagat 
aatacccgtt 
tggcgtcgtt 
cattactagc 
ggaacagtac 
ggaagctggt 
agtacagcac 
ctgagtctca 



caaaatcttc 
attcggacaa 
cgctggtgGa 
gcaagaagaa 
aagatctgga 
cgacttcggt 
aatgtcgctt 
tgtaacagtg 
ggctgttgaa 
tgttccagta 
aatcgactgg 
gacagtgaat 
gcttgtaggg 
tgtcgaagat 
ggagaaaaca 
cttgaaaaga 
cttgatcgag 
tgccacatct 
atgtgaattc 
gagtctgcat 
aagcaaattt 
tcattct 



gttaaaccca 
gagatgaaga 
agag.tttatc 
gaagaaggat 
gttgaggaag 
gattgtatat 
gagaaaccat 
gaagaaccga 
tatgttcaag 
gactatatga 
ttaatcgagg 
ctgatagata 
ttagtagctt 
ttagtactca 
atgttgagta 
ttcctcaagg 
cttgcccttg 
gtgtacactg 
cattgtcatt 
cagagggcgg 
ggttacatag 



cttcaacgac 
gggagaagag 
cttgtgttgt 
gtcaaaagaa 
agactaacaa 
ttattgatga 
acattgaagc 
tcgtggatat 
atctttacgc 
tgcaacaaat 
tacatgacaa 
gattcttgtc 
tgctgttagc 
tttcggacaa 
ctttgcaatt 
cagctcaagc 
tggagtacga 
ctcaatgtac 
actctgaaga 
cgacaggaaa 
caaaatgtga 



gattcttcaa 
aagagtgttg 
caacaagaaa 
gaagtttgat 
gaagctgaag 
ggaggaagct 
tgatccaatg 
cgatgtctta 
attttacaga 
cgacttaaac 
gtttgatctg 
caagcaaaat 
ttgtaagtat 
agcgtatacg 
caatatctcg 
agacaagaag 
gatgcttcgg 
acttgatggt 
ccagctcatg 
cttaacagga 
agctgcacac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1287 



<210> 6 

<211> 1078 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 6 

actaagcagg 

gactggacat 

aggacactcg 

gaagctgagg 

gaggagtgtg 

aatgatcagc 

tacattgtgg 

atggcagact 

cgcaaaattc 

cgtctgattg 

agagtagaga 

ctattggaag 

tgcattgagt 

actccgaaag 

aagagctatg 

tttgaagatt 

aatttaaagc 

gccagaaagc 



aggccaaagc 
gggaacagac 
gcgagcggaa 
aaagacgaag 
aagaactttc 
gttttaaagc 
aacttgagag 
atcggaagtt 
aagatagact 
gttttgagga 
aagaacatgt 
aacatgttgc 
taaaagactt 
acttgtttga 
tgaaggatgc 
ttaaatctgc 
ttatatatga 
ttcagcgtct 



tgctttcaag 
attgaaagag 
acaagcgttt 
gaggcagaag 
atcatccctg 
tgttgaccgt 
gaaggaaaga 
tcttgaaacc 
ggaggatgat 
atacattctt 
aaggcgggcc 
tgcaggcatc 
gccccaatac 
agatgtcaca 
tatgaagtca 
tatttcagaa 
tgacttggtt 
ggctgaagaa 



tctcttttgg 
attgttcacg 
aacgagtatc 
aaagctcggg 
aaatggagca 
cctagggatc 
gaaaaggcag 
tgtgactata' 
gacagatgct 
gacctagaga 
gagagaaaaa 
cttacagcca 
caagctgttg 
gaagaattag 
agaaagattt 
gatctcagta 
gggagagtga 
tttaccaatc 



aatctgtaaa 
ataaaagata 
ttggccaaag 
aagaatttgt 
aagcaatgag 
gtgaagatct 
cggaggaaca 
tcaaagctgg 
catgtcttga 
aggaagaaga 
accgtgatgc 
agacgtactg 
catctaatac 
agaagcagta 
ccatggtctc 
ctcaacagat 
aggaaaaaga 
tgttgcacac 



tgttcattcc 
tggtgctttg 
gaaaaaagtg 
caagatgcta 
tttgttcgaa 
ttttgacaat 
tcggcagtat 
tacacaatgg 
aaagatagat 
agagctgaag 
atttcgtaca 
gttggattat 
atctggttca 
tcatgaggat 
ctcgtggctg 
atcagacata 
agaaaaagag 
tttcaagg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1078 



<210> 7 
<211> 511 
<212> DNA 



WO 01/85946 



4 



PCT/IB01/01307 



<213> Arabidopsis thaliana 



<400> 7 

caagagaaac 

gctcttctta 

atgcaaggta 

gaaggtactg 

tcacagacca 

ccctggatat 

gcatcaggag 

ggttgagatt 

gtttctgagt 



cgtgggagaa 
agatggtggt 
agaccgatgg 
agacaagggt 
acaagctcgc 
ggatgctggc 
ccatttttag 
ggtgctttca 
atcaaaacta 



tgatcctcac 
tcacgctcgc 
tgatactatc 
taatgctcag 
ggggccggct 
tctccggtat 
ctgttgttat 
gaacatactc 
ttcctttaaa 



tactttaaac 
tctggtggta 
attgttatgg 
gatgatgctt 
ggagaatgtt 
tgatgtttct 
tgatcccaca 
taaaggatat 
t 



gagtcaagat 
caattgaaat 
atgcttttgc 
atgagtacat 
gttggatggt 
acgcagaggc 
aggactgttt 
aaagccctcc 



ctcagcgctc 
aatgggtctt 
tttaccagtg 
ggttgagtat 
atcactctca 
ttaaccaaca 
cagctggtaa 
agatgaacct 



60 
120 
180 
240 
300 
360 
420 
480 
511 



<210> 8 

<211> 1155 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 8 
agtagactca 
taggttccag 
ctcagcgctc 
aatgggtctt 
tttaccagtg 
ggttgagtat 
tcactctcac 
taaccaacag 
agctggtaag 
tgaacctgtt 
. ctgcaaacag 
tctggatcta 
taatggagac 
gagtcatctg 
agaggatgag 
ggtccatgga 
caacaacaaa 
ctcttctttt 
ctgttgtgta 
aaaaaaaaaa 



cctgattcaa 
caagagaaac 
gctcttctta 
atgcaaggta 
gaaggtactg 
tcacagacca 
cctggatatg 
catcaggagc 
gttgagattg 
tctgagtatc 
tactattcat 
ctatggaaca 
tatgttgctg 
gttcagtctc 
tctcaactaa 
ctaatgtcgc 
tctcccactg 
ggtttctagt 
aagaattata 
aaaaa 



cctccgacaa 
cgtgggagaa 
agatggtggt 
agaccgatgg 
agacaagggt 
acaagctcgc 
gatgctggct 
catttttagc 
gtgctttcag 
aaactattcc 
tagatgtcac 
agtactgggt 
gacaaatatc 
gctttggagg 
ctaagataac 
aggtcataaa 
actcgtcgga 
tttggattga 
atgctaatca 



catcttctac 
tgatcctcac 
tcacgctcgc 
tgatactatc 
taatgctcag 
ggggcggctg 
ctccggtatt 
tgttgttatt 
aacatactct 
tttaaataag 
ttatttcaag 
gaacactctt 
agacttagct 
agttgtgcca 
tcgggatagc 
agatgaatta 
tccagaccct 
cccatcattt 
gaataataca 



tacgacgata 
tactttaaac 
tctggtggta 
attgttatgg 
gatgatgctt 
gagaatgttg 
gatgtttcta 
gatcccacaa 
aaaggatata 
attgaggact 
tcatctcttg 
tcttcttctc 
gagaagcttg 
tcatcccttc 
gcaaagataa 
ttcaactcaa 
atgattacat 
gttgtccttt 
gaagaagatt 



cttcacagac 
gagtcaagat 
caattgaaat 
atgcttttgc 
atgagtacat 
ttggatggta 
cgcagaggct 
ggactgtttc 
agcctccaga 
ttggtgttca 
attctcacct 
cactgctggg 
agcaagccga 
ataagaaaaa 
ctgtggaaca 
tgcgtcagtc 
attgaagttg 
catttatttt 
ttggttaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1155 



<210> 9 

<211> 1308 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 9 

atgtattgct 

gtacaggaga 

gtatcaatac 

gatgtgagta 

aaggcaaaca 

gccatggata 

aagatcagga 

actgagcaac 

tccaacgtcg 



cttcttcgat 
gttttgtacg 
ctccaacaaa 
atacctctgc 
gaaaatgtct 
ttctggtaga 
tggctgaagc 
aagaagatgg 
aagatccaca 



gcatccaaat 
aataacgaga 
accttctttt 
agatattatt 
aaaagagcct 
tatgcataca 
ccaagatgtc 
atcaggtgtc 
gtgttgcagc 



gcaaacaaag 
tcacgagcta 
aaacagcaaa 
tattcagaac 
aaaaaagcag 
gaaaaatcaa 
tctctttcaa 
atggagttac 
ttgtatgctg 



aaaatatctc 
aaaaagccat 
agagacgtgc 
ttcgaaaggg 
caaaggaagg 
aattagcaga 
actttaaaga 
ttcaagttgt 
ctgatatata 



tacttcagat 
gggaagagga 
agtacttaag 
aggcaacatc 
tgctaacagt 
agatttgtcc 
tgaagaaatt 
agatattgat 
tgacaacata 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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catgttgcag 
atcgacccag 
aagctggttc 
agttacattg 
aaatatgaag 
tacacaagac 
ttatcggttc 
tacaaggtgc 
gtggaatata 
gcccgatgga 
agatatgagg 
accagtggct 
gtggcaaagc 



agcttcaaca 
acatgagaaa 
cagatacgct 
aaaggcaaag 
agctttccgc 
gagaagtgct 
ctaccaccaa 
ctttcattga 
gtttcctaag 
cactcgacca 
tagctgagct 
gtactctcgc 
tgacatctcc 



acgacccttg 
gattctgatt 
ttaccttaca 
actccagctc 
accaggggtg 
gagcatggag 
aacatttctg 
actggagtat 
gttcctgcca 
aactgaccat 
gaagaacaca 
tgccacccgt 
caaacgagtc 



gctaattata 
gactggcttg 
gtgaatctta 
cttggtgtct 
gaggagtttt 
attcaaattc 
aggcggttca 
ttagcaaact 
tcactaattg 
ccttggaacc 
gttctcgcca 
gagaaataca 
acattactat 



tggagcttgt 
tagaagtttc 
tcgaccggtt 
cttgcatgct 
gcttcattac 
taaattttgt 
ttaaagcagc 
atctcgccga 
ctgcttcagc 
ctactctgca 
tggaggactt 
accaaccaaa 
tctcaaga 



gcagcgagat 
tgacgactac 
tctgtccaac 
tatagcttca 
ggccaacaca 
gcactttaga 
tcaagcttcg 
attgacactg 
tgttttccta 
acactacacc 
gcagctcaac 
gtttaagagc 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1308 



<210> 10 

<211> 1006 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 10 

agacttcaca 

caaagatgca 

ctaaaaccgc 

cggaaaatgt 

actaccttct 

aaggcatcac 

tgtcaaatct 

actattcgct 

gtggtcaata 

ggaccaggac 

tgaaccgcgt 

cagctcctgc 

gagcttcttc 

tctccatggt 

gagttatgtt 

gatttttgaa 

ttgttgtgaa 



ttttaccatt 
attatcaatc 
aagccctcca 
gaatctcacc 
ctcgactgga 
aatctttgtc 
cacaaaggat 
ttcggaattc 
ctccttgaaa 
taaagtcagc 
gcttctaccc 
tcctatcgtt 
accaaagtcc 
tatttccggt 
tttaagttac 
acccattttt 
ttgaaattgt 



atttgctctg 
tttatcgctg 
gctccagtgc 
gagcttttaa 
gtcattgaga 
cctaaagatg 
cagcttaagc 
aagaacttga 
ttcactgatg 
agcagtgttt 
gaagcaatct 
agtgctcctt 
tcacacaaga 
ttggtggcat 
aatgtgaaag 
tattatacat 
tcctccaaaa 



agctcagtag 
tcgttgcgct 
tgccaccgac 
gtgtagctgg 
ctttccaaaa 
atgctttcaa 
agcttgttct 
gccaatctgg 
tttctggcac 
tctccactga 
ttggtactga 
cggattctcc 
actccggaca 
tgttcttgtg 
attgtattac 
ttttatcatt 
aaaaaaaaaa 



gagagttcaa 
tatcgtctgc 
accagctcca 
tccgttccac 
ccaagctaac 
agctcagaag 
cttccatgct 
tccagtgagc 
ggttaggatt 
ccctgttgcg 
tgtccctcca 
ttcagttgct 
aaagctgcta 
atcagatggt 
atcatttgaa 
attattgttt 
aaaaaa 



gaaacaatgg 
tctgcatctg 
gcaccagccc 
acattcctcg 
aacactgagg 
aatcctcctt 
ctgcctcatt 
acctttgctg 
gattctttat 
gtttaccaag 
atgcctgctc 
gattctgaag 
cttgcaccaa 
tttgcagatt 
ttgtcttttt 
gtcattacga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1006 



<210> 11 
<211> 643 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 11 

atttatcatt acagtctgat ttgagctaag ttctctcatc ataaactctc cttggagaat 60 

catggctatt tcaaaagctc ttatcgcttc tcttctcata tctcttcttg ttctccaact 120 

cgtccaggct gatgtcgaaa actcacagaa gaaaaatggt tacgcaaaga agatcgattg 18 0 

tgggagtgcg tgtgtagcac ggtgcaggct ttcgaggagg ccgaggctgt gtcacagagc 240 

gtgcgggact tgctgctaca ggtgcaactg tgtgcctccg ggtacgtacg gaaactacga 300 

caagtgccag tgctacgcta gcctcaccac ccacggtgga cgccgcaagt gcccataaga 360 

agaaacaaag ctcttaattg ctgcggataa tgggacgatg tcgttttgtt agtatttact 420 

ttggcgtata tatgtggatc gaataataaa cgagaacgta cgttgtcgtt gtgagtgtga 480 

gtactgtatt attaatggtt ctatttgttt ttacttgcaa gttttcttgt tttgaatttg 54 0 

tttttttcat atttgtatat cgattcgtgc attattgtat tatttcaatt tgtaataaga 600 

ttatgttacc tttgagtggt tgtttaaaaa aaaaaaaaaa aaa 64 3 
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<210> 12 

<211> 484 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 12 

aaggaagaag 

gtattcaagg 

ggcctggtaa 

tttggtggac 

aacgttcgag 

attcgagcag 

gcagtgagaa 

atctgaggtt 

aaaa 



caggaatgta 
aatgcccttt 
ttagttcatc 
ttgctgggct 
atgtgtttaa 
aggaagcgat 
gactttaagc 
tgtctgaaaa 



ttggggatac 
cgagggtggt 
tgagctgaaa 
tgaagaaagt 
tgtatacttg 
gtttatatca 
ttcgataaaa 
agagtgatgt 



aaagtacgat 
tacgattatt 
ataccaacat 
attgaagatg 
aatacttgtc 
cttcagtact 
agagtcaaag 
aatgtaactg 



atgcatcaca 
tgattggtac 
ttaggcacct 
ataatcagta 
cacatcaagg 
tccaggaacc 
aagctatttt 
ttttagaaaa 



attaagttca 
ctcggagcac 
attgattgca 
taaggggaaa 
tagccgaacc 
catcagcagg 
gttctcatag 
aaaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
484 



<210> 13 

<211> 688 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 13 

agatggggaa 

taacgaaaat 

gccatgaaga 

aggagaagat 

cggagaaaac 

ttagcatagc 

gcgtaatctc 

atttaaagtc 

cagctacaat 

tcgaatcagc 

gtatcttgaa 

atgatcttag 



gaagaacaag 
aatcgatgga 
tacggagata 
aaagaagaag 
gaaagtccaa 
catagctggt 
tctttctctc 
cccttcttcc 
tttccgaatc 
tgctacgaat 
gtatctagag 
atatgtgggt 



agaagtcaag 
gactctaaaa 
gaaccggagc 
aggaagaaca 
gaggaggaaa 
tcaatcatcc 
tatctctccc 
atttctaaca 
gacgagatcg 
gcttctgata 
acaccacaat 
atgttgcc 



acgagtctga 
agaagaaaaa 
aaaagatgag 
agaaccaaga 
agggaaatgt 
acaacactca 
ttcgtttctc 
tttctcagct 
tagtgttcga 
gcaatgaaag 
atttgaggaa 



gctcgaattg 
taagaataag 
tctcgacgga 
ggaggagcca 
agaagagggt 
atcacttgag 
tgtttttcca 
cgccggccaa 
caataagagc 
tggtgcctcc 
atctctcttc 



gagccagagc 
aagaagagaa 
gactcgaggg 
gagcttgtga 
agagccactg 
ctcgccacac 
ttcccagata 
attgctcgtg 
agctcagaaa 
tttctcgttc 
cccaagcaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
688 



<210> 14 

<211> 461 

<212> DNA 

<213> Arabidopsis thaliana 



<220> 

<221> misc_f eature 
<222> 396 

<223> n = a,g,c, or t 



<400> 14 

gtcagtgctg 

ttgatgatcg 

gcaatgctga 

tactcatccg 

gtatattaca 

gtgaagcctg 

acaacatttt 



tctggcatgg 
atggttcgaa 
gaaatgtttt 
tcggtttcat 
ttggaacagt 
ttagaccaaa 
ggttcttttc 



actgtatcct 
agctatttac 
ggttctcatc 
ggttttaagc 
tatacctatc 
gaccagaaaa 
tttttttcca 



ggatacatta 
cggtggcaac 
aatttcctct 
ttgcacgaaa 
gctgtgcttc 
gaagaataat 
cttggnccgt 



tattctttgt 
aagcaatacc 
acacagtagt 
cactagtcgc 
ttctcagcta 
gttgtctttt 
tttatgtaaa 



gcaatcagca 
tccgaaaatg 
ggttctcaat 
cttcaagagt 
cttagttcct 
taaaaaatca 
acaagagaaa 



60 
120 
180 
240 
300 
360 
420 
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tcaagatttg aggttttatt cttaaaaaaa aaaaaaaaaa a * 4 61 

<210> 15 

<211> 862 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> misc_f eature 

<222> ,292,294,339 

<223> n = a, g, c, or t 



60 



<400> 15 

ggtttttgaa tacatggaca ctgatgtcaa gaaattcatc agaagtttcc gtagcactgg 

caagaacatt ccaacccaaa ctatcaagag cttgatgtat caactatgca aaggtatggc 120 

attctgccat ggtcacggga tattgcacag agatctcaag cctcacaatc tcttgatgga 180 

tcccaagaca atgaggctca aaatagcaga tcttggttta gccagagcct tcactctgcc 240 

aatgaagaag tatacccatg agatattaac tctttggtat agagctccag angnttcttc 300 

ttggtgccac ccattactct acagctgtgg atatgtggnc tgttggctgc atatttgctg 360 

aacttgtgac caaccaagca atctttcagg gagactctga gctccaacag ctcctccata 420 

ttttcaagtt gttgggacac ccaatgaaga aatgtg^cca ggagtgagca cactcaagaa 48 0 

ctggcatgaa tacccacagt ggaaaccatc gactctatct ctgctgttcc aaacctcgac 54 0 

gaggctggag ttgatcttct atctaaaatg ctgcagtacg agccagcgaa acgaatatca 600 

gcaaagatgg ctatggagca tccttacttt gatgatctgc cagaaaagtc ctctctctaa 660 

ggatttaaaa tcttcagtta gtatctttcc aagttttatg gtttttctag ttttgcttct 720 

ttcaagcata tctctagtgt gctgcttccc cctctatgaa tcatcctttc tttagcataa 780 

tatatcactt ctgattgttg tttctttcta ttcgaatatt tggattaacg gctttaatgt 840 

tcttaaaaaa aaaaaaaaaa aa 8 62 

<210> 16 
<211> 1114 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 16 

acccaaaaga aggatgagta tggagatgga gttgtttgtc actccagaga agcagaggca 60 

acatccttca gtgagcgttg agaaaactcc agtgagaagg aaattgattg ttgatgatga 120 

ttctgaaatt ggatcagaga agaaagggca atcaagaact tctggaggcg ggcttcgtca 180 

attcagtgtt atggtttgtc agaagttgga agccaagaag ataactactt acaaggaggt 240 

tgcagacgaa attatttcag attttgccac aattaagcaa aacgcagaga agcctttgaa 300 

tgaaaatgag tacaatgaga agaacataag gcggagagtc tacgatgcgc tcaatgtgtt 360 

catggcgttg gatattattg caagggataa aaaggaaatc cggtggaaag gacttcctat 420 

tacctgcaaa aaggatgtgg aagaagtcaa gatggatcgt aataaagtta tgagcagtgt 4 80 

gcaaaagaag gctgcttttc ttaaagagtt gagagaaaag gtctcaagtc ttgagagtct 540 

tatgtcgaga aatcaagaga tggttgtgaa gactcaaggc ccagcagaag gatttacctt 600 

accattcatt ctacttgaga caaaccctca cgcagtagtc gaaatcgaga tttctgaaga 660 

tatgcaactt gtacacctcg acttcaatag cacacctttc tcggtccatg atgatgctta 720 

cattttgaaa ctgatgcaag aacagaagca ggaacagaac agagtatctt cttcttcatc 780 

tacacatcac caatctcaac atagctccgc tcattcttca tccagttctt gcattgcttc 8 40 

tggaacctca ggcccggttt gctggaactc gggatccatt gatactcgct gaccgagctt 

ctattcccaa attcttcaag aagaagaagt aatgatctaa ttggtatact aaaaaattat 



900 
960 



acatctggtt tagtgttcaa ttgagagaga ctgtaaaatc aattcatagg ccaacaaatg 1020 
tttgtttatc caattttcct ttttattcga acttgatgcg atatttcaac ggaaacagaa 10 80 
actattgttt taaaccaaaa aaaaaaaaaa aaaa 1114 
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<210> 17 

<211> 794 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 17 

aagatgcaac 

gatcagttat 

tcgatgaaag 

gagatcagag 

gctatgattc 

gtgaggaaga 

aaaaaggaaa 

gtggagctga 

aagctggagg 

gtcgctggga 

ttcttttatg 

ttgaactaga 

gattggcaga 

aaaaaaaaaa 



cgacagagac 
cggagagtat 
ctttgtcggc 
aaagagttca 
gcgaggaact 
agattgattc 
cagagtacaa 
tcaccaagct 
agctaagcaa 
tttggctatt 
attctagtaa 
tgttgcaaag 
attgtaatag 
aaaa 



gtcgcagccg 
gagcttcagt 
gttcaaggcc 
agctcagctt 
tgaaggtttt 
tctcgacaaa 
ggatgctctt 
acaggagttg 
gaacattgat 
caaagttcta 
tatatataat 
aaaatgtaac 
taatgtaaag 



gcgccgtcgg 
agccaaatga 
aaagaagagg 
ggtcgtgttg 
gctgatccca 
gaattaaagc 
gaagcattca 
gagggagaaa 
ctaaccaaac 
aaaaagtcac 
ttataaaata 
aaagttacga 
aaactatgtt 



atcaaggccg 
agaaggaaga 
agatcgagaa 
aagatgagtc 
tgaggaagga 
cattggggaa 
atgaaaagaa 
gcgagaaatt 
cttagtgttg 
tttttagagt 
aaaagtaaga 
tggcactaca 
tgttccggaa 



ccggcttaag 
cgatgagttg 
gaagaagatg 
caagcgtctc 
agttactatg 
tacagttcag 
caaggagaag 
caggttcaag 
gacgagcaga 
attttcattg 
agatatgtgt 
ttatcgacgt 
aaaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
794 



<210> 18 

<211> 448 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 18 

cagaaacaag 

gatgatgtcc 

cccaaagctg 

cccatcataa 

ctaatgcttg 

agcaattttc 

ctactatgtc 

aaaaaaaaaa 



ctccaggtgc 
cagatcttgt 
ctgcttctta 
aaatgtttgt 
cgaacagaaa 
cttttattaa 
ctgtcgcttc 
aaaaaaaaaa 



aggtgatgtc 
agtgggagag 
gaggaggagg 
cgctcgacct 
gacttggttt 
ggttagatcg 
atacacttct 
aaaaaaaa 



ccagcaacaa 


tccaagaaga 


ggacgatgat 


60 


actttcgaga 


cccctgctac 


tgaagaggct 


120 


aagaagaagg 


agaagagctc 


acctgcaaaa 


180 


cttctgagca 


ctgtcagatt 


cttgttttct 


240 


tattatcact 


tgatgctttt 


tggtccgaac 


300 


ctttttgttt 


accacctgtt 


caaatgagta 


360 


tgcaacacag 


tcctttgttt 


tgagtcaaaa 


420 








448 



<210> 19 

<211> 1152 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 19 

atggaggacg 

caagctcctc 

aaaccacaac 

catactcctt 

ggaggaagcg 

gttctaatcg 

caacattgga 

actgatattc 

gctaagattg 

ttgattggtg 

ggaggagctc 

attgtgcagc 



acgacgagat 
cttctccgcc 
cggggctttc 
ctgtaaccgg 
gtggtggtgg 
aagcttgggg 
aagaagtagc 
agtgtaagaa 
cttctggtga 
gtactacaac 
ttgggaatag 
aacaacaaga 



tcagtcaatt 
gattttgcca 
ttctcaatct 
tggtggtggt 
aggaggaaga 
agatcgattc 
tgagattgtg 
cagaattgat 
tggacctagt 
attcattgct 
ccgttcgagt 
gaagagaggc 



ccatctccgg 
acaaacgacg 
ccgtccatga 
agcggaaaca 
gatgattgtt 
tctgaaccag 
aacaagagtc 
acggtgaaga 
aaatgggttt 
tcttcaaaag 
atgtttaaac 
tctgattcga 



gagattcttc 
tgacggtggc 
acgctttagc 
gaaacggacg 
ggagcgaaga 
gtaaaggaac 
gtcaatgcaa 
agaagtataa 
tcttcaagaa 
cttcagagaa 
ggcaaactaa 
tgcggtggca 



cctttcacca 
cgtcgtgaag 
gttagtggtt 
aggaggagga 
agctacaaag 
tttgaagcaa 
ataccctaaa 
gcaagagaaa 
gcttgagagt 
ggctcctatg 
aggtaatcag 
ttttaggaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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cgtagtgctt 
tctgctgaga 
cggttgaagg 
atacttggat 
gaactggaaa 
ttgaaaactc 
cgaggagaaa 
gtaagtagct 



ctgagactga 
gtctcccacc 
tggataagag 
ttacggaagc 
aggagaggat 
aattggagat 
ggaggatcgt 
ga 



gtctgagtct 
tttgcaaccg 
tggaggtgga 
ttatgagaag 
gaaatttgct 
aacacagaac 
tgatgatgat 



gatcctgaac 
attcaaccgc 
gggagtggag 
gcggaaactg 
aaagagatgg 
aatcaagaag 
gatgatcgca 



ctgaggcttc 
tttcgtttca 
ttggagatgt 
ctaagcttaa 
agttgcagag 
aggaagagag 
atggcaagaa 



tcctgaggaa 
tatgccaaag 
ggcgagggcg 
gttaatggcg 
aatgcagttc 
gagcaggcag 
taacggcaat 



780 
840 
900 
960 
.1020 
1080 
114 0 
1152 



<210> 20 

<211> 409 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> mis cofeature 

<222> 201, 344,365,369 

<223> n = a,g,c # or t 



<400> 20 

cctccttctc 

ttcttccatg 

cacgtctgcc 

cgactctctc 

taatcaatcg 

gctttatctt 

ttacngatnc 



cacgcttctt 
gccaccgtat 
tcagattccg 
gattccttct 
cctctttctc 
tcctttaacc 
ttaattgcga 



cttcttcttc 
cttcttcctc 
attctacttt 
nctccatgag 
cccctacgcc 
aagatcatgc 
tccctttcgc 



ctcaatctct 
ctggccaaac 
tccctctcac 
tcttaactcc 
caatttaccg 
ttgcttcgcc 
gagattttcc 



cttacgattc 
cccaacccta 
cgcgatcgcg 
gacgaaccta 
gtgatgcctc 
tgtnggcact 
ggcgggatt 



cttcaaatca 
atcccgattc 
tagacgaacc 
atcagacttc 
ctccgttcgt 
gaccgtggct 



60 
120 
180 
240 
300 
360 
409 



<210> 21 

<211> 758 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 21 

gtcaggctca 

ccactgctag 

gtcaagaggt 

atgctcagtg 

tcaactccgg 

ccccatcgtc 

ggtcggtggc 

aaaagaacac 

cggtgaacgg 

ttttctaaaa 

caacttaagc 

ataatttgta 

attactgcag 



tgattccaga 
ctctaagggt 
aaggaggggt 
gttagctgtc 
atctcaagtg 
gtctctgtct 
tcagttcagg 
cgttgttatt 
cggtgaaatg 
gctttactac 
ggtgagatat 
aaacctttta 
agatctatga 



atagcttgct 
actctggttc 
gcggatagag 
tcaagtgaca 
aaagactcat 
ttattcaaag 
ttggttgaag 
cttggcatgg 
tctcagcttg 
ttatactctt 
ggtgtatagt 
tcgctacccg 
atcgtagata 



tcgctctcac 
ggatcttcaa 
cagagatcta 
aaggaacggt 
cccgaattgc 
gagtgttacc 
gaactcagta 
atgggagctt 
agtaccacaa 
ttgttccttc 
tttgtgtata 
ggtcgactct 
ctgaaaaa 



gcaggatggc 
tactgttgat 
cagtttggcc 
ccatgtcttt 
acctgatgct 
gaggtatttc 
catagccgcc 
ctacagatgc 
ctgtctgaaa 
tctctcttta 
taataatgat 
agagccctat 



catttgttgg 
ggtaccttgc 
ttctcttcaa 
ggtctcaaag 
actccctcat 
agctcggagt 
tttggccatc 
cagtttgatc 
ccgccttcag 
tatctctctg 
gggtcgtcct 
agtgagtcgt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
758 



<210> 22 

<211> 624 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 22 
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atggactttt 
tttttcgaag 
tactttgatg 
atgatagtct 
aaggcaatgc 
catcttcctg 
gcttcctctc 
agaaagcttt 
ttggtatgct 
ctttgtcgta 
cctatgtgca 



gtgaggtatg 
aacatttaca 
tgagagatcg 
tacctgctgg 
ggctattcgt 
caaggaaaga 
tctatatctg 
cctgtcaagt 
tgagttgttg 
tcccatatac 
atatgttgag 



cccggaaaag 
cactgatgag 
taatgaagct 
gatctatcat 
gggtgaaccg 
atatgtcgat 
gctttctgaa 
attggatatt 
taaacttagt 
gaaaagatta 
tttt 



cttccaaact 
gagatccgtt 
tggattaggg 
cgcttcactg 
gtatggacac 
aacttcatga 
acaaggatct 
tatatgtatt 
tacactctct 
cattgggatt 



atgaagtgaa 


agtgaagagc 


60 


actgcgttgc 


aggaactggt 


120 


tattggtaaa 


gaagggaggt 


180 


tggactctga 


caactatatc 


240 


catacaatcg 


cccacacgac 


300 


tcaatgcctc 


ggcttagaga 


360 


ataaacaagg 


cctacaataa 


420 


cctgtgtaga 


atgatggctt 


480 


gatatgtctc 


tctttaccat 


540 


catattgtct 


tacgttcgtt 


600 






624 



<210> 23 

<211> 495 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 23 

ccagttttcc 

ctaagcagag 

ccaggttcca 

tcaatgagct 

ctcctccagc 

gaatgtggtg 

cagtattact 

aatagattga 

ggtctaaaaa 



gatcactcgc 
cactgctgat 
gacaatgtcg 
ggagcaaagc 
ctccaaatca 
ttcattgaca 
tttaaaattc 
tgatgtggtg 
aaaaa 



aagaaaaccc 
atgactgctt 
gactccatca 
atcaatgatc 
ggcgatgaac 
tgtccgaagg 
ctgtttaaga 
cattacatta 



taaaaatgga 
ttgtccaaaa 
tcacaaagat 
taagagccga 
ccaaaacacc 
aaaaagaaaa 
aacgagtttg 
ttctccacct 



tggtcatgat 


tctgaggata 


60 


tcttctccag 


cagatgcaaa 


120 


tgatgacatg 


ggaggcagaa 


180 


gatgggagta 


gaaggcactc 


240 


ggctagttcc 


tcttaaaaag 


300 


actatgaaat 


atgttaagag 


360 


ttgtttatta 


aagttcatca 


420 


atgaattgca 


tttctatttt 


480 






495 



<210> 24 

<211> 580 

<212> DNA 

<213> Arabidopsis thaliana 

<400> 24 



taataagctt tttacaaaaa aaaaaaaaaa aaaaaaaaaa 



gagaagcaaa 


ccgccgtgga 


60 


gacgcgttgt 


tcgttaaaat 


120 


aaagaatctg 


ttgatggaaa 


180 


gatgcttctg 


aggcttacga 


240 


gcggctgagg 


aatttgcgaa 


300 


cttagttctc 


tttagagaac 


360 


cttgttggtt 


cttgttatgt 


420 


tgtatcagga 


gaaagaggct 


480 


tgttttttca 


gatatctaaa 


540 






580 



<210> 25 
<211> 656 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 25 

cggccgcgtc gacgcttgag agattcctct" ggctaaaccc agatggagtt tggatctttt 60 
cttgtgtcct tagggacatc ttttgttatc ttcgtcattc tcatgcttct cttcacctgg 120 
ctttctcgca aatctggaaa tgctcccatt tattacccga atcggatcct taaagggctg 180 
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gagccatggg 
acttcctctg 
ttcttgagca 
ctgcctctag 
agcaaaggaa 
tcgaggctgt 
ttgtggaaag 
-gtaaaacccg 



aaggcacctc 
aacaagatgt 
ctgttctggg 
ccgctacaga 
cttttagcca 
gggcgttcct 
cttataagca 
agcaattcgc- 



cttgactcga 
cgttaactta 
gatatttgct 
caacaacata 
acttgataat 
aggagctgtt 
tgtct cttca 
tattcttgtt 



aacccttttg 
tccggcgtcg 
tgttccagtc 
aagaacacaa 
ctatcaatgg 
tactggatat 
ttgagagctc 
agggatatgc 



cttggatgcg 
atactgctgt 
ttcttctcct 
agaatgcgac 
ctaacatcac 
ctttggtcac 
aagctctgat 
^ctgcaccacc- 



tgaagctttg 
ccactttgtc 
accaactcta 
agataccaca 
aaaaaaaagt 
atatttcttc 
gtctgctgat 
:tgacgg. . ... 



240 
300 
360 
420 
480 
540 
600 
656 



<210> 26 

<211> 985 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 26 

gttcacactc 

gagtttctct 

ctttccacag 

acagacgctt 

cagcatctaa 

gtcgcgaagc 

caaatgagaa 

ccatctacag 

aagcaagtca 

aaggcattgc 

gatcatccat 

gaacaagagg 

gaaaccgatg 

ttcatcataa 

attcaagaac 

aacatgagct 

ctggccttag 



cggctggtga 
ctgttacagc 
caattatgga 
taggacagct 
aggatattgc 
ttcgatcagc 
gtgatgcagc 
cagccgaaga 
aggaaataac 
ttgagtctct 
gtgcacagag 
acaatcttca 
tgtct caatg 
aatgcggagc 
ctaaaggagg 
tagaggaaat 
ctagaacagc 



actgcaaaga 
agatgatgtt 
tggctggatg 
tgtgtctgag 
cggtactttg 
tctcgagtct 
tttgtttacc 
ctcgagatta 
aagacaagcc 
tgatgaactt 
agaaattgat 
agacgaaaag 
gaatgttttg 
taactccaac 
cgaaatagtg 
gaaacaagtg 
tgatg 



cagattaggt 
tcaggagtaa 
gctggagtag 
tatgcaaagc 
gcttcggaag 
gttgaccaca 
ttggaagaag 
gcctccctca 
tctgtccacg 
aacgaacgaa 
acggctcacc 
agaccttcaa 
caattcaaca 
tcagagctcg 
agagttgtgc 
tttggtcagt 



catggcttgc 
ccactggcca 
gagctccggt 
gagtctacac 
aggcagaaga 
aaagaagaaa 
gcagttcccc 
tttctctgga 
ttttgagtaa 
tgccttctct 
agttggtcga 
tagattcaat 
caggaggctc 
tgatcaaagc 
caagaccttc 
tgcccgaagc 



agaaagtttt 
attagagctt 
gcctcctcac 
ttctcagatg 
tgctggtcaa 
gattttgcaa 
tgttcaaaat 
tgccatactg 
aagcaagaaa 
gcttgatgtt 
gacaattcca 
atcttcgact 
ttcagctcca 
ggatgcccgt 
ggttttagaa 
tctaagttca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
985 



<210> 27 

<211> 527 

<212> DNA 

<213> Arabidopsis thaliana 



<220> 

<221> misc_f eature 
<222> 512 

<223> n = a,g, c, or t 
<400> 27 

acttatgaga ggttaccgat tgaggaagaa caacagcaag agcagccgct tcaactagaa 60 
gatgggaaga agcagaaaga agagaatgat gataacgaga gtgggaataa cggaaacgaa 120 
ggatcgatgc agccgccgat gtataatatg cctcctaatt ttatcccaaa tggtcatcaa 180 
atggctcaac acgacgtgta ttggggtggt cctccgcctc gtgctcctcc ttcgtattga 24 0 
ttaagttaga taggcggtgg ttggtgcgtt ctttttactg gaatgattat attttccatt 300 
aggatgggta ggcttttgtt attaaagcta tcaagtttct ttttttttac ggataattcg 360 
gatgacaatt agctagtgtt tgtttgtttg ttttgtggcc ggcttttctg cttgactatt 420 
ttgatcgcgg atagctttgt atgaaagtga attgattgta gaatcgtctt ttgaattttg 4 80 
atgttggaaa aaaccaagca atggtgtgtg gnctttgcaa tggaagc 527 
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<210> 28 

<211> 610 

<212> DNA 

<213> Arabidopsis 



thaliana 



<220> 

<221> misc_feature 

<222> 482, 494,495,516, 560, 567, 587, 607 

<223> n = a,g,c, or t 



<400> 28 

atcaaaagct agagtcttgg ccattcctga tgatctagca aatgtgtcat gcggtgtgga 60 

acagattgaa gaactgaaag gattgaacct tgttgagaaa gatggtggtt catcttcttc 120 

tgacggggct aggaacacta atcctgaaac tagaaggtac agtggttcct tgggtgtaga 180 

ggatggagcc tatactaatg agatgctcca gtccatagag atggttactg atgtgctgga 240 

ctctcttgtg aggagggtta cagtagcaga atctgagtct gctgtcaaaa ggagagggca 300 

cttttgggag aggaaagaaa tcagtaggaa agactatcca aatcgaaaat ttgtccgtga 360 

agttagaaga gatggaacga tttgcttatg ggactaatag tgttctaaac gaaatgcggg 420 

aaaggattga ggaattagtt gaagagcgat gaggcagagg gaaaaagctg tggaaaacga 4 80 

anaggagttg tgtnntgtga agagagagtc gagtcnttaa aagctcctca gtactttacc 540 

atgtcgagaa cactctttcn ccggagncat tcaaaccatg aggagtnttt gacggtggca 600 

ctaaacnccg 610 



<210> 29 

<211> 546 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 29 

atgaccaata tcgccatggc tgatgctctc aaatctcttg agattgttga tggtcttgat 60 

gaatacatga atcaatctga atccagtgct ccgcattctc caaccagtgt agcaaagctg 120 

ccaccaagca ctgcaactag aacaactcga cggaagacca caacaaaagc tgagcctcag 180 

ccatcatctc agttggtgtc ccgttcttgt cgttcgacga gcaagtctct tgctggagat 240 

atggaccagg aaaacataaa caagaatgtt gctcaagaaa tgaagactag caatgtcaag 300 

tttgaagcca atgtgctcaa aactccagca gcaggaagca caaggaaaac ttcagcagca 36C 

acttcttgca ctaagaagga tgaattggtc cagtcggtct acagcactag gagatcaacc 4 20 

aggctgttag agaaatgtat ggccgatctg agtttgaaga ctaaagaaac tgtggataat 480 

aaacctgcca agaatgaaga tacagaacag aaagtatctg cacaggagaa gaatctaact 540 

ggttag 54 6 



<210> 30 

<211> 492 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 30 

atgctgatgc 

atccgagcac 

atacatcagg 

aggataggcg 

gtgaaaaacg 

acaagaaaga 

tgtcccagga 

cagctaccac 

aagtgttgtt 



tgtgtgggtt 
ccttgcatcc 
ggtcaaaact 
tgaggaaagg 
cgaaaggcat 
gaagaaggag 
tggaacattt 
ttctgcaaga 
ga 



cacggtcttg 
tctcagaaag 
gttgaagatg 
aaacctgcaa 
gagaagcgaa 
aaaaaaagaa 
tgctcttcag 
tttaaatctg 



gatatgctaa 
aagatgcaga 
gaccggatga 
agagagagaa 
gcgctcgtga 
ggcatgactc 
aggaagagtg 
ttgcttattt 



agcaccacga 
ttcagcacgc 
tgttgagagg 
ggaagaaaga 
ttcagatgat 
tgattctgat 
gtcggctagg 
catttacgaa 



ccttgggaag 
ttaccagcag 
gaccaaaaga 
catgataggc 
agaaagaagc 
tgaagcgaat 
taccaaaatc 
tcgtggagta 



60 
120 
180 
240 
300 
360 
420 
480 
492 
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<210> 


31 


<211> 


723 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


misc feature 


<222> 


559, 622, 677 


<223> 


n = a, g, c, or t 


<400> 


31 



gcaaaagaga gaaacatctg acccggaatc tgacctgaaa acccggaaga atcgaaaaat 60 

ggggaaagat ggtctgagcg acgatcaggt ctcgtcgatg aaggaagcct tcatgctctt 120 

cgacaccgat ggcgacggca aaatcgcacc gtcagagctc gggatcctca tgcgatctct 180 

cggcggaaac ccgacccaag cccagctgaa atccataatc gcatccgaga atctctcttc 24 0 

accgtttgat ttcaacagat tcctcgatct catggcgaaa catctgaaga cggaaccttt 300 

cgatcgccag ctccgtgacg cattcaaagt gctcgataag gaaggtaccg ggttcgttgc 360 

tgtggcggat ctgaggcata ttctgaccag tatcggagag aagctggagc ctaatgagtt 420 

cgatgagtgg atcaaggagg tggatgttgg atccgatgga aagatccggt atgaagattt 480 

catagcaagg atggttgcta agtgagatct aatcttttat gttttgaaag ttgaaatttt 540 

taagaagaga ttcttttgng gttttttcac ttggttggtt tgatttcgag cgaatcctaa 600 

ctaggggttg gtttatcatt gnggaatttg cttactaact ttggcttctt catggttggg 660 

tttcaatttt taatggnaaa tggtggctgg gggaattcct aaaaaaaaaa aaaaaaaaaa 720 



<210> 32 
<211> 344 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 32 

cgcggagtct cctttcgatc aagagaaatg cgtccgattt ttgcaatctc tcagagaatg 60 
cgttctatca aagaaagtaa agaagttctc gataccgagt caagatcacg actctgaggg 12 0 
agcagcttca gctacaaaga gaccttcata acgttctttg ttccgatttt cttttatcgt 180 
ttgagttgta atcatgtaat tgattttaat gtcatgcctt ggattcataa gctgggtcat 240 
gccttgtttc ccctttgttg tcttgtatgt tgaatattgc aaactctaaa gagcatattt 300 
ataagaagaa ataaaagttt ctacaaaaaa aaaaaaaaaa aaaa 34 4 



<210> 33 
<211> 1131 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 33 

atgacaacta ctgggtctaa ttctaatcac aaccaccatg aaagcaataa taacaacaat 60 

aaccctagta ctaggtcttg gggcacggcg gtttcaggtc aatctgtgtc tactagcggc 120 

agtatgggct ctccgtcgag ccggagtgag caaaccatca ccgttgttac atctactagc 180 

gacactactt ttcaacgcct gaataatttg gacattcaag gtgatgatgc tggttctcaa 24 0 

ggagcttctg gtgttaagaa gaagaagagg ggacagcgtg cggctggtcc agataagact 300 

ggaagaggac tacgtcaatt tagtatgaaa gtttgtgaaa aggtggaaag caaaggaagg 360 

acaacttaca atgaggttgc agacgagctt gttgctgaat ttgcacttcc aaataacgat 420 

ggaacatccc ctgatcagca acagtatgat gagaaaaaca taagacgaag agtatatgat 480 

gctttaaacg tcctcatggc tatggatata atatccaagg ataaaaaaga aattcaatgg 540 
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agaggtcttc 
cttaggaaca 
gagcacttat 
gtccagactc 
cattttgatt 
atgaagtttt 
tgtcacaatt 
ctggatatgt 
atcattccta 
tccccgtctc 



ctcggacaag 
gaattgagaa 
atagctcagg 
gtcctcacgc 
tcaacagcac 
gtgatcaacc 
tcacgccaga 
acgagactca 
tgcctgagac 
ttccagggat 



cttaagcgac 
gaaaactgca 
aaatgctccc 
aacagtagaa 
tccatttgag 
gccgcaacaa 
aaaccctaac 
tcttcaatcg 
taacaacgtt 
aatgaactcc 



attgaagaat 
tattcccaag 
agtggcggtg 
gtggagatat 
ctccacgacg 
ccaaacggtc 
aaaggcccca 
caacaacatc 
acttccagcg 
agcatgaagc 



taaagaacga 
aactggaaga 
ttgctcttcc 
cagaagatat 
acaattttgt 
ggaacaacag 
gcacaggtcc 
agcagcattc 
ctgatactgc 
cggagaattg 



acgactctca 
acaaagaaat 
ttttatcctt 
gcagctcgtg 
cctcaagact 
ccagctggtt 
aacaccgcag 
tcagctacaa 
tccagtgaaa 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1131 



<210> 34 

<211> 631 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 34 

agagtatctg 

tgaacctcct 

gggattcgcc 

agaggatgga 

aaacgaagaa 

cattgaggaa 

ctcgccggtt 

gtccgaagag 

cttataatac 

aacatcatct 

tatgtttaag 



aagaaagggt 
caagacatca 
aaagaagtga 
tcacagatgc 
gctttgaaag 
gaggtagtgt 
tggaagaatg 
gaggattgag 
ttcgtcagtt 
attaaagttg 
aaaaaaaaaa 



caccaataag 
tggacgcact 
ctaagaagaa 
atctgctcaa 
aggtggctct 
tggattggta 
ttaagccttt 
tcactttttt 
ttcagaattc 
tcttcgtttg 
aaaaaaaaaa 



cgcgctcaaa 
cttcaatgct 
gaattactta 
ttcgattggg 
ggttcttaaa 
cgaaaagggt 
tgtggagtgg 
cttccctcct 
ttaaatcttt 
gatttggttt 



agtttcatct 
ctctttgatg 
gcggctgctg 
acattctgtg 
gcattgtacg 
ctcaccggag 
cttcagagcg 
aacttttctt 
ttgctgtgtt 
tgacgacttt 



cgtctctctc 
gtgtgggaaa 
caacaatgca 
gaaagaatgg 
accaagacat 
ctgacaaaag 
ctgagtctga 
tgcggcattt 
cttataaaga 
gggaaatatt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
631 



<210> 35 

<211> 1333 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 35 
gctggaggta 
taggttgcat 
ggaggagcaa 
tagagagaag 
ggagtatgaa 
tgattcgtat 
. gaacgttgaa 
ggatgttgta 
tcttgggaga 
cgaggaggag 
tggcaaagaa 
tgatgcggtc 
gtttgatgat 
cgagaggtta 
gaaaaaagga 
aaggaatatc 
ttgagttttg 
caggtatgga 
gaagaaaacg 
tcagaatgtt 



gagaggaatg 
gaggagaaag 
gctgagtttg 
gagaatgctg 
gctaaagaga 
agaaataatg 
ggggttacgg 
cttcgtcttg 
ttgaagtttc 
gctaaaacgt 
acaaacggga 
gatggagaga 
tcggagaagg 
gaagctctag 
gacaaaggtg 
gatcttactc 
ggtttgagtc 
agtgacatgt 
ttctcagaat 
cgtctttctt 



cgtctgctgt 
ctgcgatgca 
atcaagaagc 
agcttgagaa 
aaatggggat 
gcgattctga 
attggaaata 
atgagtgttt 
ttgaagagaa 
ttgagagtaa 
agcacagagt 
tggaaaacgg 
gtgagaatgt 
aggcagatag 
tacatctcct 
gcgtcagaga 
cactctttgc 
tgcttgtgag 
aaaagtattt 
cgttcatttt 



tgctgccagt 
gatggaagcg 
tttgcagttg 
ggagctagag 
gttgaggagg 
tgagaatagc 
tagagagaat 
agatgattat 
actcacagat 
tggtagcatc 
tatccagtca 
gttaagtaac 
gacgatagaa 
agagttctta 
ccatgagatt 
aaacggagac 
atagtgaccc 
gcaaggaaca 
taagtatata 
cattattata 



gaaacaatgg 
ttgcagtatc 
ttgaatgagc 
gtgtatagaa 
agattgagag 
aatggagagt 
gagatggaga 
gatggagaga 
cttaataacg 
aatggaaatg 
aaaagattac 
ggaaaccatc 
gaagaagtgg 
agacattgtg 
ctgcaacatc 
atgagtttat 
aaagaacaag 
acgacaaggt 
ctctgaggaa 
agttttgttt 



cgatgatcaa 
agagaatgat 
ttatggtgaa 
agagaatgga 
attcctctgt 
tacagtttaa 
atacgccggt 
ggctttcgat 
aagaggacga 
agcatattca 
ttcccctgtt 
acgaaaacgg 
atgagcttta 
ttggttcatt 
ttcgtgatct 
gagtttgatt 
aaaaatcata 
ttcagatgaa 
aagtgtcaga 
tttatattga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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agatttattt agagagaggg aagtgtcagt ataatttcac ttttatattt tatatttggg 1260 
agttgtcttt atgagtggtg gtaatagaaa aaggtagaat gatgagtgaa gaaaaaaaaa 1320 
aaaaaaaaaa aaa 1333 



<210> 36 

<211> 2289 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 36 

cttatgcaaa 

tcgccttgtt 

tcttttaaaa 

gatcactgtt 

cgatgggtta 

agctctttca 

ggctgaaata 

cacgtggata 

cattcttggg 

tcgtgaaacc 

tggcgcaatt 

tgaagcattg 

cctgaatgac 

ctcttggttc 

atcgtatttc 

cttattgtcc 

acaattcttg 

ttgattttgc 

gtcaatcgcg 

atgggcaatt 

tccaatcctt 

tgatccggct 

gaaggtacac 

gtgcggcgtt 

ctggaaacca 

acgaggacgg 

ctgtgcggct 

gagtgaacta 

aagaacaaca 

caccttcttc 

gaaaaggccc 

ttccatcccc 

tctcctgtaa 

tggaaaagct 

agagattctt 

tctttgaatt 

gaaaaagaag 

ttgtagaggt 

aaaaaaaaa 



ctctcagcag 

aaggatattg 

gagcgaatga 

cttgatagtg 

ttcaatatgt 

gagtttcttc 

ctggtgcaga 

aacgagttcg 

gctatcttgc 

aatgaagaac 

ctctctgttg 

aattggatat 

atatttgaca 

tggaggttca 

ttgtccacaa 

gaagaatgtg 

agggagaaga 

ttacttcccc 

aagggaaaga 

ataagcctct 

ggtaaaagaa 

tctggaaact 

atggttgctg 

caagatactt 

aataggcaga 

tgacttagaa 

acaacagttc 

ctcatatcac 

acagcagcag 

gacatcatca 

tggtcaatta 

gacaatcatc 

aattgtatat 

acggtcgtgt 

cttcagtccc 

tgtttcattg 

tgatcatgtg 

cacttattgg 



attctgatgc 

tgacggaaag 

acgttctcaa 

ttccagacat 

tgagcgactc 

aagagataaa 

gggctgcttc 

taaaacttgg 

cttgcatatc 

ttcgttcaat 

caaggaggca 

caacactttt 

cccttctaaa 

tgctggtgta 

tttccgagct 

tgtacttttg 

taatcttgac 

ggagttatcg 

acttttcgtt 

gcttattagc 

gacattaacg 

ccaatcttta 

aaaacacttt 

aggacaagac 

gcaacttcag 

gacgataaca 

gaaaacgtac 

tcttcctctt 

cagcaacaac 

tcagttgcag 

cagctttaac 

atcttcatct 

ctctcttttt 

ccatcatctc 

tcagtctacc 

tttggagctt 

gttttgtgtt 

atattttatg 



caatgtccaa 
tgatcagttc 
cccttacgtc 
tgacatgctt 
tagtcatgaa 
aaattcacca 
tcctgatgaa 
gggagaccag 
tgacaaagaa 
ccatgttgaa 
gctatcaagt 
aaacaagcat 
gcactatctg 
gcaaaagatc 
gataattctc 
gatgccgaaa 
tttgcttcta 
aaactgagag 
gccttgtata 
tcaggcttac 
tccaaatttc 
crttaccttag 
atggtcttct 
tcaaaactgt 
gagttccttt 
tcaacagttc 
agaatctaca 
cttctacatc 
aacagcaaca 
ataacaatag 
ctacctggta 
tctttgtgtg 
tagtaactct 
tttctctctg 
tttatattgt 
tttatatttt 
gtttttttat 
tcaaatgatg 



agtgctgctc 

agtattgagg 

cggcaatttc 

gggtttctgc 

atacgacagc 

tctgtagatt 

ttcactcgat 

ctcgtgcgtt 

gagaaaatca 

ccctcagatg 

gagtttgagg 

cgtactgagg 

attcttctga 

cacaacactt 

ttttggaaag 

gagtctaccg 

ccatggttca 

aactattaaa 

cttcatggtg 

caagcatgcg 

ttaggccagc 

attgcagctt 

tatgttactt 

gccaacgtac 

ctctcagtat 

tcaccaagga 

tcgtggccag 

aaaggaggtg 

acaacaacaa 

acctccatca 

atcataaata 

gacaccaccg 

tcaagtttcg 

tcgggttttt 

ttttttgggg 

taccttatgt 

aactggaaaa 

ctccttttta 



atcttttgga 
aattcatacc 
tggttggatg 
cagactttct 
aagctgattc 
atggtcgcat 
taacagccat 
attatgctga 
gggtggttgc 
gttttgatgt 
ctactcggat 
tcttgtgctt 
tgacgtggtg 
tcgccagctc 
gcgcggtgcc 
agagctctct 
ggcattgaat 
aggttcactc 
ccattcaccc 
agtgtcgtaa 
ttgataaaat 
ctggaaccag 
cctcagcaaa 
tcattcagta 
aagcatcaaa 
atcaattttg 
gcaaggacta 
aggagatctg 
caacgaccac 
agaacttcaa 
ataaataata 
atcccttttg 
acggaacttg 
tttatttacg 
gtttctcgtt 
ggagatgtaa 
ccacatgagt 
caaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
84 0 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2289 



<210> 37 

<211> 1094 

<212> DNA 

<213> Arabidopsis thaliana 
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<400> 37 

cttgattgaa 

ggagaaagcg 

ggaacaagaa 

aactgagaat 

gcatcttcaa 

gaaacgtggg 

ccctgaggaa 

tgacatgtat 

aactccaagg 

ggtgattgaa 

tttaagagct 

ctactcattt 

acttgatgca 

tcaaaccctt 

tgaagagcca 

gtctctataa 

tgttcattat 

accaggtccc 

aaaaaaaaaa 



acttcagttg 
attgaagata 
ggagatgagg 
cgagcagaag 
atgattggag 
aagagggcat 
ccatttgaag 
acaatagcag 
aaatggtcac 
ttgggtggaa 
cccatgcctt 
ggcagcccgt 
gccatcgcca 
gacaaggtca 
gcatcaacag 
gaaacagaaa 
aagtacaagt 
tccagtctga 
aaaa 



aatccaagga 
tttcaaaaga 
atgatgatga 
gagaagattt 
tccaactctt 
ctcgtatgac 
cattcaaaga 
acgtttgggg 
aagagtggga 
ttccaacgat 
cagccttctt 
tgtacgatga 
tagttgcaga 
tatctgctag 
taagctctta 
tcggtagaag 
catgtagttc 
tacttaattc 



aacgactgaa 
agctgacaat 
aaatgaagag 
agtgaagaat 
gaaagaatcc 
acttgaggat 
aatgagggaa 
ttggacatgg 
agtcgagttg 
tggtgattgt 
gaagatcttg 
gatcatcaca 
tatggaaacc 
acaatctaat 
gttatcatat 
gagctgaatc 
cgagtgtaga 
tttagtgttg 



tcagtggtta 
gaggaggatg 
gaagaagtgg 
aaggcagctg 
gatgaagcaa 
gatgcagatg 
agaaaagtgt 
gagaaggatt 
gcaattgtgc 
gcagtgatat 
cagacgacac 
ttgtgtttgg 
acagggatca 
gagagtccgc 
cctcttctgc 
tgtcttagtt 
acagttttta 
gatctttcta 



caggtgaatc 
atgatgagga 
ttgttccaga 
acgcgaagaa 
acagaacaaa 
aggattggtt 
tcgatgtggc 
ttaagaacaa 
tcatgacaaa 
tacgagctgc 
acagtcttgg 
accttggaga 
ctgtccctga 
ggtctgagcc 
ttgttgtgaa 
atgaaagttt 
ctagtgttgc 
tataagaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1094 



<210> 38 

<211> 1204 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 38 

aaccgattca 

gcaggagaag 

tgctcttctc 

tatgcagggt 

tgaaggtact 

ctctcagacc 

ccctgggtat 

gtatcaggag 

ggttgagatt 

ttctgagtat 

gtactactca 

cctttggaac 

ctatgttgcc 

cgctaactcc 

gcctcaactc 

actaatgtca 

tgctgacgac 

gttttttggc 

ttccctatat 

attaacataa 

aaaa 



gcctccgaca 
ccatgggcct 
aagatggtgg 
aaaaccgagg 
gagactaggg 
agcaagctgg 
ggatgttggc 
ccattcttag 
ggggcattca 
cagactatcc 
ttggacatca 
aagtactggg 
gggcaaatat 
cggtatggag 
gcgaagataa 
caggttatca 
tcatcagatc 
tgcggaaatt 
ctctagtgta 
atcgatcatt 



gtatattcca 
ccgatcctaa 
ttcacgctcg 
gtgatacaat 
ttaatgctca 
ctgggaggtt 
tctcgggtat 
ctgttgttat 
gaacatatcc 
ctcttaacaa 
cttatttcaa 
tgaacactct 
cagacttggc 
gaattgcgcc 
ctcgggatag 
aagacatctt 
cagagcccat 
gactatcggt 
gtatctgctt 
atgtttttgc 



ctacgacgac 
ctacttcaag 
ctccggtggc 
catcgttatg 
gtctgatgcc 
ggagaacgtt 
tgatgtttcg 
tgatccaaca 
agagggacat 
gattgaggac 
gtcatctctc 
ttcttcttcc 
tgagaagctc 
agccggtcac 
tgcaaagata 
gttcaattcc 
gattacatcg 
ttgacccggt 
cagacaaaga 
gttaaaaaaa 



gcttcacaag 
cgcgttcaca 
acaatcgaga 
gatgcttttg 
tatgagtata 
gttggatggt 
acacagatgc 
aggactgttt 
aagatctcgg 
ttt* ggtgtac 
gatagtcacc 
ccactgttgg 
gagcaagcgg 
caaaggagga 
actgtcgagc 
gctcgtcagt 
tgaagttggt 
ttatgaggca 
tctttgggtt 
aaaaaaaaaa 



ccaaaatcca 
tctcagccct 
tcatgggtct 
ctttgcctgt 
tggttgaata 
atcactctca 
ttaaccaaca 
cggctggtaa 
atgatcatgt 
attgcaaaca 
ttctggatct 
gcaatggaga 
agagtcagct 
aagaggatga 
aggtccatgg 
ccaagaagtc 
ctattctttt 
atgcccattg 
attaaatgac 
aaaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1204 



<210> 39 

<211> 1715 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 39 
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cttttaagtt 
tgaaaataaa 
tagccttgga 
attaaaaatc 
tgatgtaggg 
aagagcacta 
caacaagaga 
agttactagg 
aactaagaaa 
tgaaagtgat 
aatgctggag 
agaagagcct 
atatatccat 
taactatatg 
gttaattgag 
tgtcatcgac 
tgttactgct 
tctcatcttg 
aatggccaac 
atttctcaaa 
gctttgcctt 
aatctacact 
tcacacaggc 
tcacaaggca 
ctgtcatgct 
tatgacttaa 
gagcaagttg 
aaagagtgta 
aaagaatctt 



gggggatgtt 
taacttccgc 
ttaatcggat 
aaaactttcg 
ggaggagtag 
agtactatta 
tcagtttctg 
aagtttgctg 
ccagactcag 
aaagaaggag 
gagattgaac 
gtgatcgata 
gatatgcata 
gacaatcaac 
gtgcactaca 
agattccttg 
ttgttgcttg 
atctctgaca 
accttgcaat 
gctgcccaat 
gtggagtatg 
gctcagtgta 
tacaacgaaa 
ggaacaggga 
gcaagaactg 
ctcgagtttt 
cccaaaatgg 
ttcttctctt 
ctgaaaaaaa 



tcgattttga 
agataacgaa 
caatgagtct 
ttctctttga 
taggaaagat 
acaagaacat 
aaagagatgg 
ctcagttagc 
tttcaagtga 
gtgactctaa 
agatggagaa 
ttgatgcctg 
ccttctacaa 
aagatcttaa 
agtttgaact 
cggttcatca 
catgtaaata 
aagcttactc 
tcaatttctc 
ctgacaagaa 
agatgctaga 
cacttaaggg 
aacagctact 
agctcacagg 
aaccagctgg 
tctttagaac 
gagaagaatg 
tctaaggtct 
aaaaaaaaaa 



aatttgattt 
gaagaagaaa 
ccaaggtacc 
tttttccccc 
caagacgacg 
cactgaagcg 
catttgtaat 
agatcataag 
agaaccagag 
tgagccaatg 
ggagattgaa 
tgataagaat 
gaattttgag 
tgagagaatg 
gatggaggaa 
aatcgtgagg 
tgaagaagtt 
tagaagagaa 
tctaccaact 
gcttgagatt 
gtatcttcca 
atttgaagaa 
ggcatgtgcg 
agttcacaga 
gtttctgatt 
aaaaagagtg 
gatctttaga 
ttagatcttt 
aaaaa 



cttcaagaga 
atggttagat 
ctaaatcgat 
aaattgattt 
gcaacaacag 
ccgtcttacc 
aaaccacctg 
ccacatatcc 
acgattatca 
tttgtacaac 
atggaagatg 
aatcctttgg 
aaacttagtt 
agaggaatcc 
actctttatc 
aaaaagcttc 
tcagttccag 
gtgctagata 
ccatatgttt 
ttatcattct 
tctaagctgg 
tggagcaaaa 
agaaagatgg 
aagtacaaca 
taatattaat 
tgagagaaag 
tatcatggca 
cttcacttga 



agagatttaa 
cagatgaaaa 
cgattttgtt 
tgaatttact 
gaccgacaag 
cttatgctgt 
tgcatcgacc 
gtgatgagga 
ttgatgtgga 
atactgaagc 
cagacaaaga 
ctgcggttga 
gcgtgcctcc 
tcattgactg 
tcacaatcaa 
agcttgttgg 
tggtagatga 
tggagaagct 
tcatgaaacg 
ttatgatcga 
cggcctcagc 
cctgtgagtt 
ttgctttcca 
catctaagtt 
aagaatctaa 
agagatagta 
agtagcccaa 
gagagaataa 



60 
120 
180 
240 
300 
360 
4 20 
4 80 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1715 



<210> 40 

<211> 2195 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 40 

aacccacgtc 

tcttctcctt 

aagaggaccc 

tcttcgagtt 

tatgacttgt 

tcggaagaac 

tggtttcaag 

tgcggatgat 

catcaaaccg 

gcgtaacgag 

cagggacagg 

agggaacggg 

taaggaagac 

gcagtttgtt 

agagatgatg 

tcggatatat 

gtttatgact 

tcaatcttta 

tggtcttggc 

gagaagcatc 



aattcttttt 
ctctgatcgt 
gattcgggta 
cttgtcgttg 
ctctacagag 
aagaatggtt 
ctccttgaac 
tcgaagagcg 
gtacgtattg 
tggaatgttt 
cagcagcagc 
aggagctcga 
tcatcgagtt 
gctgctgtga 
aatgtacccg 
ctgagacggc 
ggtcaagatc 
gctgttactg 
cggcctacac 
ttcaactttg 



caaaggcata 
tcgttttctg 
ctgctgctgg 
atgatgatcc 
taactaaatg 
ttgatattgt 
acgttggttt 
ttgtgttgaa 
aggctttgaa 
ctgaacattc 
atagggagga 
ggaagcggaa 
taaagaaacc 
atcagctagg 
ggctaacgcg 
ttggaggagt 
agagttttgg 
gtcagctccc 
tcgctaaacc 
aaaacccaaa 



tattctctct 
gacgagagag 
tgggtcaaac 
aacttgtctc 
taacagagca 
cattagtgat 
agagatggat 
aggagtgact 
gaatatatgg 
tggaggaagt 
tgctgataac 
ggaagaggaa 
acgcgtggtt 
cgttgacaaa 
agaaaacgta 
atcgcaacac 
acctctttct 
tcctcagagc 
agggatgtcg 
aataagattt 



gtttcaaact 
atggtaaatc 
tccgacccgt 
atgatcttag 
gagagcgcat 
gttcatatgc 
ttacctgtta 
cacggtgcag 
caacatgtgg 
attgaagata 
aactcgtctt 
gtagatgatc 
tggtctgttg 
gctgttccta 
gccagtcacc 
caaggaaata 
tcgttgaatg 
cttgcacagc 
gtttctcccc 
ggagacggac 



ttgtgtctct 
cgggtcacgg 
ttcctgcgaa 
agaggatgct 
tgtctctgct 
ctgacatgga 
tcatgatgtc 
ttgattacct 
tgcggaagaa 
ctggcggtga 
cagttaatga 
aaggggatga 
aattgcatca 
agaagatctt 
tccagaagta 
tgaaccattc 
gatttgatct 
ttcaagcagc 
ttgtagatca 
atggtcagac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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gatgaacaat 
tggacagaat 
aggaccatcg 
aagaagcgat 
agtgttacca 
cttcccgttg 
ggtcaacagc 
aagctacgac 
taaactaaac 
agcaactgca 
gcagagaaaa 
gtcaccgagc 
agtgaagtca 
cgagcagtat 
cgtagataac 
gaaactcaga 
ctcaaaacat 



ggaaatttgc 
gttcagagca 
atgctaccat 
ctcactggtg 
actactcact 
gcaagtgccc 
tcggatgcaa 
atatttaacg 
gattgggatc 
accgcaaccg 
agacgggaaa 
cggaatctgt 
gaaagagtgg 
aatcaagaag 
gagttcgaat 
ctagactgca 
ggcaataacc 



ttcatggtgt 
gcggaatgat 
ccctcgggca 
cgctggcggt 
cggtcttcaa 
cagggatttc 
aaggaggttc 
attttccgca 
tgcggaatat 
cagcattttc 
cggacgcaac 
atcatctgaa 
cggagacagt 
atctgatgag 
ttgacggata 
agattctttg 
gaagggaaag 



cccaacgggt 
gttgccagta 
acagccgata 
tagaaacagt 
taacttcccc 
agttccagta 
atcagctgct 
gcaccaacag 
gggattggtc 
cacttcggaa 
agttgtgggt 
ccacgttttt 
gacttgtcct 
cgcatttctc 
ctccatcgat 
tttttcttct 
ataga 



agtcacatgc 
gcagaccagc 
ttgtcaagca 
atccccgaga 
gcggatctac 
tcagtttctt 
actgctggat 
cacaacaaga 
ttcagttcca 
gcatactctt 
gagcatgggc 
atggacggtg 
ccagcaaata 
aaacaggaag 
aatatccagg 
ccctccttcg 



gtctgcgtcc 
tacctcgagg 
gcgtttcaag 
ccaacagcag 
ctcgcagcag 
accaagaaga 
ttggtaaccc 
acatcagcaa 
atcaggacgc 
cgtcttctac 
agaacctgca 
gttcagtcag 
cattgtttca 
gcatcccatc 
tctgactaca 
aggtacaaag 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2195 



<210> 41 

<211> 1413 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 41 

aggctgtgtt 

ttcactcgtt 

caacagagag 

aagaggaaga 

cttgaggcca 

tcccagttgg 

aaagttgaag 

acagctgata 

tcagcttcta 

aaatttgtag 

gaaatccact 

gagagaatac 

gaggtctctg 

gactggttcc 

tgcatgctaa 

ttcattacgg 

aagcatttta 

agagcagctc 

ctaacagagt 

gcttcagcgg 

acacttgagc 

caagatctgc 

caagagaaat 

tgaaggtttc 



ttatcgtggg 
cccttgcctc 
taaacacaaa 
agcgagcggt 
aaaacagcaa 
caacttctgt 
ttgcatcaaa 
actgtattga 
cagctgagaa 
acatcgattc 
acaatttgcg 
agaaggatgt 
aagaatacac 
tccatggaaa 
ttgcctcgaa 
ataacaccta 
gctttcaaat 
aagcctctcg 
tgacattaat 
gttttctcgc 
attacacaac 
agcttaacac 
acaaatctgt 
aactcctaac 



atttttaaac 
tgctttgcgc 
aagaccagcc 
tctaggggag 
gcagataaag 
tacttcagaa 
tacagcagga 
gatatggaat 
aagtgctgtt 
agatgacaag 
tgtttcagag 
cacccagtcc 
acttgcatct 
ctacgtgcaa 
gtatgaggaa 
cacaagagat 
atacactccc 
cctgagccca 
agactaccat 
caagtggaca 
gtacaaagca 
caaaggttgc 
ggcggttctc 
cgataatagt 



atggggaagg 
gcttcagaag 
ttggaggata 
atcacaaatg 
aaaggacgcg 
gtcacagatc 
aacctttctg 
tctagattgc 
attggtagtt 
gatcctttac 
cttaaacgca 
atgcggggaa 
gacactctct 
agacagcaac 
atctctgctc 
caggtcctgg 
actccaaaaa 
agccttgaag 
ttcttaaagt 
atggaccaat 
tcggatctga 
cccttgagcg 
acgtctccaa 
ttt 



aaaatgctgt 
tgacttctac 
caagagccac 
ttaactccaa 
gtcatggatt 
ttcagtccag 
tttctaaagg 
ctccaagacc 
caactgtacc 
tgtgctgcct 
gaccacttcc 
ttctggttga 
acctcacagt 
ttcaactgct 
cacgcattga 
aaatggagaa 
cgttccttag 
tcgagtttct 
ttcttccttc 
caaaccaccc 
aagcatctgt 
ctatacgcat 
agctacttga 



gtctcggcca 
tacacagaat 
tggacccaac 
tacagctata 
ggcgagtaca 
gaccgatgca 
cacagataac 
tcttgggaga 
ggatatccca 
ctatgcccct 
ggactttatg 
ttggcttgtg 
gtatctcata 
cggcatcact 
ggagttttgc 
ccaagtactt 
gagatttctc 
agccagctat 
cgttgttgct 
atggaatcca 
tcatgcctta 
gaagtatagg 
cacgctattc 



60 
120 
180 
240 
300 
360 
420 
4 80 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1413 



<210> 42 

<211> 2766 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 42 
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atttgagagg aagctttatt ttgtgtgtag atggcgaata atcctccgca gtcttctggt 60 

acccagggtc agcattttgt tcctgcagct tcacaacctt ttcaccctta tggacatgta 120 

cctccaaatg ttcaaagtca gcctccacag tattctcagc cgatacagca gcagcagctc 1.80 

tttccagtga gaccaggtca gcctgtgcat attacatcat cctcacaggc tgtatcagtt 24 0 

ccgtatattc aaacgaacaa gattctcact tctggatcta ctcaaccaca gccaaatgca 300 

cctccaatga cgggctttgc tacatctgga cctccatt.tt cttctccata tacttttgta 360 

ccatcatctt atcctcagca acaaccaaca tccttggtcc aaccaaattc tcagatgcat 4.20 

gtagctggcg tccctccagc agcaaacact tggcctgttc ctgttaatca aagcacatca -4 80 

cttgtttccc ctgtgcagca gactgggcaa caaacaccgg tcgcagtttc cacagaccca 540 

ggaaacttga ctccgcaatc tgcatctgac tggcaggagc atacatctgc tgatgggaga 600 

aaggctgatg catccactgt atggaaggaa tttacaacac ctgaaggaaa gaaatattat 660 

tataacaagg ttacaaagga gtctaagtgg acaattccgg aagatttaaa gttagctcgg 720 

gaacaagccc aactagctag tgaaaaaacg tccctttcgg aagctggatc tacccctcta. 780 

tcccaccatg ctgcatcctc gtctgatcta gcagttagca ctgtgacttc tgttgttccc 8 40 

agcacatctt cagcacttac tggacattct tcaagcccta ttcaagcggg tttggctgta 900 

cctgtcaccc gtcctccctc tgttgctcct gttactccaa catctggtgc aattagtgac 960 

actgaggcta ctacaatgta ctatttttcc ttgggaagtt ttgctgagaa taaggaaatg 1020 

tctgtgaatg gaaaagccaa tttgtcacct gctggtgaca aagcaaatgt cgaggaacct 1080 

atggtatatg ctactaagca ggaggceaaa gctgctttca agtctctttt ggaatctgta 1140 

aatgttcatt ccgactggac atgggaacag acattgaaag agattgttca cgataaaaga 1200 

tatggtgctt tgaggacact cggcgagcgg aaacaagcgt ttaacgagta tcttggccaa 12 60 

aggaaaaaag tggaagctga ggaaagacga aggaggcaga agaaagctcg ggaagaattt 1320 

gtcaagatgc tagaggagtg tgaagaactt tcatcatccc tgaaatggag caaagcaatg 1380 

agtttgttcg aaaatgatca gcgttttaaa gctgttgacc gtcctaggga tcgtgaagat 14 4 0 

ctttttgaca attacattgt ggaacttgag aggaaggaaa gagaaaaggc agcggaggaa 1500 

catcggcagt atatggcaga ctatcggaag tttcttgaaa cctgtgacta tatcaaagct 15 60 

ggtacacaat ggcgcaaaat tcaagataga ctggaggatg atgacagatg ctcatgtctt 1620 

gaaaagatag atcgtctgat tggttttgag gaatacattc ttgacctaga gaaggaagaa 1680 

gaagagctga agagagtaga gaaagaacat gtaaggcggg ccgagagaaa aaaccgtgat 17 4 0 

gcatttcgta cactattgga agaacatgtt gctgcaggca tccttacagc caagacgtac 1800 

tggttggatt attgcattga gttaaaagac ttgccccaat accaagctgt tgcatctaat 18 60 

acatctggtt caactccgaa agacttgttt gaagatgtca cagaagaatt agagaagcag 1920 

tatcatgagg ataagagcta tgtgaaggat gctatgaagt caagaaaggc aaattttaaa 1980 

tctgctattt cagaagatct cagtactcaa cagatatcag acataaattt aaagcttata 204 0 

tatgatgact tggttgggag agtgaaggaa aaagaagaaa aagaggccag aaagcttcag 2100 

cgtctggctg aagaatttac caatctgttg cacactttca aggaaatcac cgtagcttca 2160 

aattgggaag atagcaaaca actagtagaa gaaagtcaag agtacagatc gattggagat 2220 

gaaagtgtta gccaagggtt atttgaggaa tacataacga gtttacaaga aaaggcaaag 2280 

gagaaggagc gtaagcgtga cgaggaaaag gttagaaaag agaaggaaag ggacgagaaa 234 0 

gagaaacgga aagacaagga taaggagaga agggaaaagg aaagagaacg tgaaaaagag 24 00 

aagggaaaag agaggagtaa acgggaagaa tcagatggtg agactgctat ggatgtgagc 24 60 

gaaggtcata aagacgagaa aagaaaggga aaagatcgtg acagaaaaca tcgaagacgg 2520 

catcacaaca attctgatga agatgttagt tctgataggg atgacagaga tgagtcgaag 2580 

aaatcatccc gtaaacatgg taatgatcgc aaaaaatcaa gaaagcacgc aaactcgcct 2640 

gaatcggaga gtgaaaaccg gcataaaaga cagaaaaaag agagtagtcg ccgaagtggt 2700 

aatgacgagc tagaggatgg agaagttggg gagtgatagt gaaattcgac attaatctga 27 60 

aacctt ^ ^ 2766 



<210> 43 

<211> 1260 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 43 

tgaaacctag atttctgcaa ctgaattcct 
gacgatagca aggaagacat gggaactaga 



aattcgaaaa agaatggagg 
gaacagcatt ctaacagtag 



gttcgtcgtc 
actcacctga 



60 
120 
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ttcaacctcc 
gaaaccgtgg 
tcttaagatg 
aggtaagacc 
tactgagaca 
gaccaacaag 
atatggatgc 
ggagccattt 
gattggtgct 
gtatcaaact 
ttcattagat 
gaacaagtac 
tgctggacaa 
gtctcgcttt 
actaactaag 
gtcgcaggtc 
cactgactcg 
ctagttttgg 
ttataatgct 



gacaacatct 
gagaatgatc 
gtggttcacg 
gatggtgata 
agggttaatg 
ctcgcggggc 
tggctctccg 
ttagctgttg 
ttcagaacat 
attcctttaa 
gtcacttatt 
tgggtgaaca 
atatcagact 
ggaggagttg 
ataactcggg 
ataaaagatg 
tcggatccag 
attgacccat 
aatcagaata 



tctactacga 
ctcactactt 
ctcgctctgg 
ctatcattgt 
ctcaggatga 
ggctggagaa 
gtattgatgt 
ttattgatcc 
actctaaagg 
ataagattga 
tcaagtcatc 
ctctttcttc 
tagctgagaa 
tgccatcatc 
atagcgcaaa 
aattattcaa 
accctatgat 
catttgttgt 
atacagaaga 



cgatacttca 
taaacgagtc 
tggtacaatt 
tatggatgct 
tgcttatgag 
tgttgttgga 
ttctacgcag 
cacaaggact 
atataagcct 
ggactttggt 
tcttgattct 
ttctccactg 
gcttgagcaa 
ccttcataag 
gataactgtg 
ctcaatgcgt 
tacatattga 
cctttcattt 
agattttggt 



cagactaggt 
aagatctcag 
gaaataatgg 
tttgctttac 
tacatggttg 
tggtatcact 
aggcttaacc 
gtttcagctg 
ccagatgaac 
gttcactgca 
caccttctgg 
ctgggtaatg 
gccgagagtc 
aaaaaagagg 
gaacaggtcc 
cagtccaaca 
agttgctctt 
attttctgtt 
taaaaaaaaa 



tccagcaaga 
cgctcgctct 
gtcttatgca 
cagtggaagg 
agtattcaca 
ctcaccctgg 
aacagcatca 
gtaaggttga 
ctgtttctga 
aacagtacta 
atctactatg 
gagactatgt 
atctggttca 
atgagtctca 
atggactaat 
acaaatctcc 
cttttggttt 
gtgtaaagaa 
aaaaaaaaaa 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



<210> 44 

<211> 653 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 44 

cttaaactac 

cttggagaat 

ttctccaact 

agatcgattg 

gtcacagagc 

gaaactacga 

gcccataaga 

agtatttact 

gtgagtgtga 

tttgaatttg 

tgtaataaga 



atttatcatt 
catggctatt 
cgtccaggct 
tgggagtgcg 
gtgcgggact 
caagtgccag 
agaaacaaag 
ttggcgtata 
gtactgtatt 
tttttttcat 
ttatgttacc 



acagtctgat 
tcaaaagctc 
gatgtcgaaa 
tgtgtagcac 
tgctgctaca 
tgctacgcta 
ctcttaattg 
tatgtggatc 
attaatggtt 
atttgtatat 
tttgagtggt 



ttgagctaag 
ttatcgcttc 
actcacagaa 
ggtgcaggct 
ggtgcaactg 
gcctcaccac 
ctgcggataa 
gaataataaa 
ctatttgttt 
cgattcgtgc 
tgtttaaaaa 



ttctctcatc 
tcttctcata 
gaaaaatggt 
ttcgaggagg 
tgtgcctccg 
ccacggtgga 
tgggacgatg 
cgagaacgta 
ttacttgcaa 
attattgtat 
aaaaaaaaaa 



ataaactctc 
tctcttcttg 
tacgcaaaga 
ccgaggctgt 
ggtacgtacg 
cgccgcaagt 
tcgttttgtt 
cgttgtcgtt 
gttttcttgt 
tatttcaatt 
aaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
653 



<210> 
<211> 
<212> 



45 

1266 
DNA 



<213> Arabidopsis thaliana 



<220> 
<221> 
<222> 
<223> 



misc^feature 
(757) . . (761) 
n = a, g, c, or t 



<400> 45 

agatggggaa 

taacgaaaat 

gccatgaaga 

aggagaagat 

cggagaaaac 

ttagcatagc 



gaagaacaag 
aatcgatgga 
tacggagata 
aaagaagaag 
gaaagtccaa 
catagctggt 



agaagtcaag 
gactctaaaa 
gaaccggagc 
aggaagaaca 
gaggaggaaa 
tcaatcatcc 



acgagtctga 
agaagaaaaa 
aaaagatgag 
agaaccaaga 
agggaaatgt 
acaacactca 



gctcgaattg 
taagaataag 
tctcgacgga 
ggaggagcca 
agaagagggt 
atcacttgag 



gagccagagc 
aagaagagaa 
gactcgaggg 
gagcttgtga 
agagccactg 
ctcgccacac 



60 
120 
180 
240 
300 
360 
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gcgtaatctc 
atttaaagtc 
cagctacaat 
tcgaatcagc 
gtatcttgaa 
atgatcttag 
atctgcgtaa 
caagggaaga 
cagtattcaa 
acggcctggt 
catttggtgg 
aaaacgttcg 
ccattcgagc 
gggcagtgag 
agatctgagg 
aaaaaa 



tctttctctc 
cccttcttcc 
tttccgaatc 
tgctacgaat 
gtatctagag 
atatgtgggt 
gcacgagtgg 
agcaggaatg 
ggaatgccct 
aattagttca 
acttgctggg 
agatgtgttt 
agaggaagcg 
aagactttaa 
tttgtctgaa 



tatctctccc 
atttctaaca 
gacgagatcg 
gcttctgata 
acaccacaat 
atgttgccgg 
gaacaatacc 
tattggggat 
ttcgagggtg 
tctgagctga 
cttgaagaaa 
aatgtatact 
atgtttatat 
gcttcgataa 
aaagagtgat 



ttcgtttctc 
tttctcagct 
tagtgttcga 
gcaatgaaag 
atttgaggaa 
gtatgttgcc 
gtgaagnnnn 
acaaagtacg 
gttacgatta 
aaataccaac 
gtattgaaga 
tgaatacttg 
cacttcagta 
aaagagtcaa 
gtaatgtaac 



tgtttttcca 
cgccggccaa 
caataagagc 
tggtgcctcc 
atctctcttc 
acctcttgat 
nattgttcca 
atatgcatca 
tttgattggt 
atttaggcac 
tgataatcag 
tccacatcaa 
cttccaggaa 
agaagctatt 
tgttttagaa 



ttcccagata 
attgctcgtg 
agctcagaaa 
tttctcgttc 
cccaagcaaa 
gctcctcacc 
ccctctaagc 
caattaagtt 
acctcggagc 
ctattgattg 
tataagggga 
ggtagccgaa 
cccatcagca 
ttgttctcat 
aaaaaaaaaa 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1266 



<210> 46 

<211> 1520 

<212> DNA 

<213> Arabidopsis thaliana 



<220> 
<221> 
<222> 
<223> 



misc_feature 
(1455) 

n = a, g, c, or t 



<400> 46 

atggaattgc 

cgtttcctcc 

agtcgactcg 

tttggcttct 

gcgatttatc 

ggctgtcatg 

ggagctttga 

atgttgaaag 

tctcttattg 

ttcgaaatga 

aaaggaaaga 

tgtatggctc 

gtgtaccaag 

gctcgttgga 

ttgggtttca 

aatgtcgata 

aacatacaag 

aagaaagcgg 

ctgtatcctg 

gctatttacc 

gttctcatca 

gttttaagct 

atacctatcg 

accagaaaag 

ttttttccac 

ttaaaaaaaa 



ttgacatgaa 
tctgtttcgt 
gtaaacacat 
cctcaaatct 
gacccttgtc 
tgttttatat 
tggtattaac 
aagaaggtct 
agtactttgg 
aagattatct 
gaccatcgcc 
tctatctcta 
aatggggatt 
agtattactt 
gtggttggac 
ttttgggggt 
tcagcacatg 
gtttcttcca 
gatacattat 
ggtggcaaca 
atttcctcta 
tgcacgaaac 
ctgtgcttct 
aagaataatg 
ttggnccgtt 
aaaaaaaaaa 



ctcaatggct 
cgcaacgata 
atactcagct 
tcacttcctt 
tggattcatt 
gagtggtgat 
actgaaagtg 
acgtgaggct 
ttattgcctc 
cgaatggact 
ttatggagca 
tttagtacct 
cttgaagaga 
tatatggtct 
tgatgaaact 
tgagcttgcc 
gctccgtcac 
attgctagct 
attctttgtg 
agcaatacct 
cacagtagtg 
actagtcgcc 
tctcagctac 
ttgtcttttt 
ttatgtaaaa 



gcctcaatcg 
ccaatctcat 
gcttctggag 
gtcccaatga 
actttcttcc 
gcttggaaag 
atttcgtgtt 
cagaagaaga 
tgttgtggaa 
gaagagaaag 
atgattcgag 
cagtttccgt 
tttggttacc 
atctcagagg 
cagacaaagg 
aagagtgcgg 
tacgtatatg 
acgcaaaccg 
caatcagcat 
ccgaaaatgg 
gttctcaatt 
ttcaagagtg 
ttagttcctg 
aaaaaatcaa 
caagagaaat 



gcgtctccgt 
ttttatggcg 
ctttcctctc 
cgattggtta 
taggcttcgc 
aaggaggaat 
cgataaacta 
accgtttgat 
gccatttcgc 
gaatttgggc 
ctgtgtttca 
taactcggtt 
aatacatggc 
cttctattat 
ctaaatggga 
ttcagattcc 
agagaattgt 
tcagtgctgt 
tgatgatcga 
caatgctgag 
actcatccgt 
tatattacat 
tgaagcctgt 
caacattttg 
caagatttga 



cgccgttctc 
attcatcccg 
ttatctctcc 
cgcttcaatg 
ttatctcatt 
tgattctact 
caacgatgga 
tcagatgcct 
tggcccggtt 
tgtttctgaa 
agctgcgatt 
cactgaacca 
gggtttcacg 
tatctctggt 
ccgcgctaag 
gcttttctgg 
gaagcccggg 
ctggcatgga 
tggttcgaaa 
aaatgttttg 
cggtttcatg 
tggaacagtt 
tagaccaaag 
gttcttttct 
ggttttattc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
. 960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1520 
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<210> 47 

<211> 1142 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 47 

ttatataacc 

agagagagag 

agtttccgcc 

agtttacaga 

ccatgaggac 

gctcgctcgt 

aggcaaaact 

cagaagtttc 

tcaactatgc 

gcctcacaat 

agccagagcc 

tagagctcca 

tgttggctgc 

gctccaacag 

aggagtgagc 

ctctgctgtt 

cgagccagcg 

gccagaaaag 

tggtttttct 

aa 



tatctacact 
agcgccggag 
atggatgctt 
gccagagaga 
gaagaaggcg 
gatcctcacg 
gtactgtacc 
cgtagcactg 
aaaggtatgg 
ctcttgatgg 
ttcactctgc 
gaggttcttc 
atatttgctg 
ctcctccata 
acactcaaga 
ccaaacctcg 
aaacgaatct 
tcctctctct 
agttttgctt 



ttgatctccg 
aagaagaatt 
tcgagaagct 
aagctaccgg 
ttccttccac 
tcgtcaggtt 
tggtttttga 
gcaagaacat 
cattctgcca 
atcccaagac 
caatgaagaa 
ttggtgccac 
aacttgtgac 
ttttcaagtt 
actggcatga 
acgaggctgg 
cagcaaagat 
aaggatttaa 
ctttcaagca 



acaattcact 
ttagagagcg 
tgagaaagtt 
gaaaatcgtc 
cactctccgc 
aatggatgtt 
atacatggac 
tccaacccaa 
tggtcacggg 
aatgaggctc 
gtatacccat 
ccattactct 
caaccaagca 
gtttgggaca 
atacccacag 
agttgatctt 
ggctatggag 
aatcttcagt 
tatctctagt 



ttcccaataa 
atggacgagg 
ggtgaaggga 
gctctaaaga 
gagatctcca 
aagcaaggac 
actgatgtca 
actatcaaga 
atattgcaca 
aaaatagcag 
gagatattaa 
acagctgtgg 
atctttcagg 
cccaatgaag 
tggaaaccat 
ctatctaaaa 
catccttact 
tagtatcttt 
gtgctgcttc 



gaaccaactg 
gagttatagc 
catacgggaa 
agacgcgtct 
ttttgcgaat 
taagcaaaga 
agaaattcat 
gcttgatgta 
gagatctcaa 
atcttggttt 
ctctttggta 
atatgtggtc 
gagactctga 
aaatgtggcc 
cgactctatc 
tgctgcagta 
ttgatgatct 
ccaagtttta 
cccctctatg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1142 



<210> 48 

<211> 1189 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 48 

tagtcaacga 

tgttaattgg 

agagaagcag 

gattgttgat 

aggcgggctt 

tacttacaag 

agagaagcct 

tgcgctcaat 

gaaaggactt 

agttatgagc 

aagtcttgag 

agaaggattt 

cgagatttct 

ccatgatgat 

atcttcttct 

ttcttgcatt 

tcgctgaccg 

atactaaaaa 

ataggccaac 

tcaacggaaa 



tggatttgag 
taaattgtgt 
aggcaacatc 
gatgattctg 
cgtcaattca 
gaggttgcag 
ttgaatgaaa 
gtgttcatgg 
cctattacct 
agtgtgcaaa 
agtcttatgt 
accttaccat 
gaagatatgc 
gcttacattt 
tcatctacac 
gcttctggaa 
agcttctatt 
attatacatc 
aaatgtttgt 
cagaaactat 



acatgaacaa 
agagaaggat 
cttcagtgag 
aaattggatc 
gtgttatggt 
acgaaattat 
atgagtacaa 
cgttggatat 
gcaaaaagga 
agaaggctgc 
cgagaaatca 
tcattctact 
aacttgtaca 
tgaaactgat 
atcaccaatc 
cctcaggccc 
cccaaattct 
tggtttagtg 
ttatccaatt 
tgttttaaac 



ctaattgatt 
gagtatggag 
cgttgagaaa 
agagaagaaa 
ttgtcagaag 
ttcagatttt 
tgagaagaac 
tattgcaagg 
tgtggaagaa 
ttttcttaaa 
agagatggtt 
tgagacaaac 
cctcgacttc 
gcaagaacag 
tcaacatagc 
ggtttgctgg 
tcaagaagaa 
ttcaattgag 
ttccttttta 
caaaaaaaaa 



tgatttcgtg 
atggagttgt 
actccagtga 
gggcaatcaa 
ttggaagcca 
gccacaatta 
ataaggcgga 
gataaaaagg 
gtcaagatgg 
gagttgagag 
gtgaagactc 
cctcacgcag 
aatagcacac 
aagcaagaac 
tccgctcatt 
aactcgggat 
gaagtaatga 
agagactgta 
ttcgaacttg 
aaaaaaaaa 



tagctaactt 
ttgtcactcc 
gaaggaaatt 
gaacttctgg 
agaagataac 
agcaaaacgc 
gagtctacga 
aaatccggtg 
atcgtaataa 
aaaaggtctc 
aaggcccagc 
tagtcgaaat 
ctttctcggt 
agaacagagt 
cttcatccag 
ccattgatac 
tctaattggt 
aaatcaattc 
atgcgatatt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1189 



<210> 49 
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<211> 805 
<212> DNA 

<213> Arabidopsis thaliana 



<4.00> 49 

atgaataggg 

gtaagaagaa 

agcactctta 

aaggatgatg 

acatgggttg 

atcagccaac 

aaacaagctc 

gatgtcccag 

aaagctgctg 

atcataaaaa 

atgcttgcga 

aattttcctt. 

ctatgtcctg 

aaaaaaaaaa 



aaaagttgat 
agaagaaggc 
agagagttgg 
tagtcattca 
tgagtggtac 
ttggaccaga 
caggtgcagg 
atcttgtagt 
cttcttagag 
tgtttgtcgc 
acagaaagac 
ttattaaggt 
tcgcttcata 
aaaaaaaaaa 



gaagatggct 
tgttcacaag 
agtcaattcc 
gttcattaac 
accacagacg 
taacttggac 
tgatgtccca 
gggagagact 
gaggaggaag 
tcgacctcfct 
ttggttttat 
tagatcgctt 
cacttcttgc 
aaaaa 



.aacactgtcc 
accactacaa 
attGCcgcca 
cctaaagttc 
aaaaaattgc 
aacctgaaga 
gcaacaatcc 
ttcgagaccc 
aagaaggaga 
ctgagcactg 
tatcacttga 
tttgtttacc 
aacacagtcc 



gcactggcgg 
ccgatgacaa 
ttgaagaagt 
aagcttcaat 
aagacattct 
agctagcaga 
aagaagagga 
ctgctactga 
agagctcacc 
tcagattctt 
tgctttttgg 
acctgttcaa 
tttgttttga 



aaaggggaca 
gaggctccag 
taacattttt 
tgctgctaac 
tcctcagatt 
gcaattccag 
cgatgatgat 
agaggctccc 
tgcaaaaccc 
gttttctcta 
tccgaacagc 
atgagtacta 
gtcaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
4 80 
540 
600 
660 
720 
780 
805 



<210> 50 
<211> 1539 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 50 

aagctttact acttatactc ttttgttcct atggccaccg tatcttcttc ctcctggcca 60 

aaccccaacc ctaatcccga ttccacgtct gcctcagatt ccgattctac ttttccctct 120 

caccgcgatc gcgtagacga acccgactct ctcgattcct tctcctccat gagtcttaac 180 

tccgacgaac ctaatcagac ttctaatcaa tcgcctcttt ctccccctac gcccaattta 240 

ccggtgatgc ctcctccgtc cgtgcttcat ctttccttta accaagatca tgcttgcttc 300 

gctgtcggca ctgaccgtgg cttccggatc cttaattgcg atccctttcg cgagattttc 360 

cggcgtgatt tcgatcgtgg cggtggtgtt gcagtcgtgg agatgctttt cagatgcaat 420 

atattagccc tagttggtgg cggacctgat cctcaatatc ctcctaataa ggttatgatt 480 

tgggatgatc accagggccg atgtatcgga gaactctctt tcaggtccga tgttagatcc 54 0 

gtccggctta ggagggatcg gattattgtc gttcttgagc agaagatttt tgtctacaat 600 

ttctctgacc tcaagctgat gcatcagatt gaaaccattg ccaaccctaa gggtttgtgt 660 

gctgtttctc agggtgttgg ttctatggtt ttggtatgtc caggtttgca gaaaggtcaa 720 

gttcggatcg agcactacgc ttctaaacgg accaaattcg tcatggctca tgattccaga 780 

atagcttgct tcgctctcac gcaggatggc catttgttgg ccactgctag ctctaagggt 84 0 

actctggttc ggatcttcaa tactgttgat ggtaccttgc gtcaagagtc tggcacttct 900 

gaggatgaaa taggtaagga gggtgcggat agagcagaga tctacagttt ggccttctct 960 

tcaaatgctc agtggttagc tgtctcaagt gacaaaggaa cggtccatgt ctttggtctc 1020 

aaagtcaact ccggatctca agtgaaagac tcatcccgaa ttgcacctga tgctactccc 1080 

tcatccccat cgtcgtctct gtctttattc aaagtgttac cgaggtattt cagctcggag 1140 

tggtcggtgg ctcagttcag gttggttgaa ggaactcagt acatagccgc ctttggccat 1200 

caaaagaaca ccgttgttat tcttggcatg gatgggagct tctacagatg ccagtttgat 1260 

ccggtgaacg gcggtgaaat gtctcagctt gagtaccaca actgtctgaa accgccttca 1320 

gttttctaaa agctttacta cttatactct tttgttcctt ctctctcttt atatctctct 1380 

gcaacttaag cggtgagata tggtgtatag ttttgtgtat ataataatga tgggtcgtcc 1440 

tataatttgt aaaacctttt atcgctaccc gggtcgactc tagagcccta tagtgagtcg 1500 

tattactgca gagatctatg aatcgtagat actgaaaaa 1539 



<210> 51 
<211> 1977 
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PCT/IB01/01307 



<212> DNA 

<213> Arabidopsis thaliana 



<400> 51 

agagcttcct 

atacttgaga 

gattgccgca 

ggtcttattg 

tcgtggaata 

ggagatgatt 

cctcgcgtta 

cctttacagg 

tctggtagtt 

cccagaagcc 

aaagtgaatg 

gccaatccaa 

cccatgagta 

tctcgtgatt 

gaaaagcgga 

ttggagatgt 

ggcacagggc 

ctaaacagtg 

aaggcgggaa 

tgggtgaagc 

gatcagagac 

gagctgggag 

aaaaagatcc 

aagcttccaa 

gaggagatcc 

gcttggatta 

catcgcttca 

ccggtatgga 

gataacttca 

gaaacaagga 

atttatatgt 

agttacactc 

ttacattggg 



ctctctatat 
agcctctgaa 
aattcctcaa 
tcttcgacga 
aatctcaggc 
ccggcgccgg 
caacaacgtt 
aagatgatgg 
ctggtcagtt 
cagctgaaac 
tatatgatgg 
ttgatttgcc 
aagagaagat 
ctgatgttga 
aagacagatt 
ttctgaatcg 
attgcgagtc 
atctaaacag 
gaacaatttc 
ggtcaagaga 
ttcctcacca 
tacttagctg 
gtgaatctcg 
actatgaagt 
gttactgcgt 
gggtattggt 
ctgtggactc 
caccatacaa 
tgatcaatgc 
tctataaaca 
attcctgtgt 
tctgatatgt 
attcatattg 



ctggctttct 
gcttctcact 
agagaaaggt 
atggcggctg 
gatccagcaa 
aatcctccgc 
gattgagcca 
tgcgtgccat 
tgttgcggat 
aaatgcggtg 
agtaccacct 
tgaaaatggt 
ggtggagctt 
gggtcaggcg 
ttctaagacc 
tcagccacgg 
acctgaaaat 
cgaaggtgcc 
atgttcttat 
agaagtgatt 
caaggatcct 
gagacttgat 
tggttactct 
gaaagtgaag 
tgcaggaact 
aaagaaggga 
tgacaactat 
tcgcccacac 
ctcggcttag 
aggcctacaa 
agaatgatgg 
ctctctttac 
tcttacgttc 



atggatgtag gagttactac 
gaagaagaca tttctcagct 
ttcttcttct tcctttctcc 



acacgtgtcg 
gttttatctc 
aagatccttg 
aggaacgagc 
agaagggatt 
aaagatagcc 
gttgggcaaa 
gaaaaggccc 
atttttgctt 
ccccaatatg 
aacagaaaag 
aagaaggctc 
atgaacgctg 
caaacaaaaa 
aaaagaactg 
aacatgacta 
caagcttggt 
aaagagtttg 
gctgataact 
tacatggact 
agctttttcg 
ggttactttg 
ggtatgatag 
atcaaggcaa 
gaccatcttc 
agagcttcct 
taaagaaagc 
cttttggtat 
catctttgtc 
gttcctatgt 



aaacaggaat 
ttaaagctct 
tttctcagcc 
tcgaagcttg 
ctccaagatc 
acaagactgt 
tgacgatatt 
ggtctatcat 
ctagtagaat 
gacttgaaaa 
tatcgttgca 
caggagttgc 
catattcaca 
gtcccaatat 
gagatggtac 
agacatcacg 
atatggatga 
tatcgttgga 
atgaaaccga 
tttgtgaggt 
aagaacattt 
atgtgagaga 
tcttacctgc 
tgcggctatt 
ctgcaaggaa 
ctctctatat 
tttcctgtca 
gcttgagttg 
gtatcccata 
gcaatatgtt 



ggcgaagtct 
tactcgcgaa 
atttttttcc 
gcgcaggcct 
ctatgaacct 
gccaaatccg 
tggaaggatt 
agctgagttt 
ttctgtttcc 
ttatagtggc 
gcattttgca 
gatttcgaaa 
ggcacctgct 
aagatatctt 
gtcctctagc 
aaaccttagt 
ctcagttgat 
tacgggtcaa 
aggaacacga 
tagtgaagag 
caaacttgca 
tgaggatttg 
atgcccggaa 
acacactgat 
tcgtaatgaa 
tgggatctat 
cgtgggtgaa 
agaatatgtc 
ctggctttct 
agtattggat 
ttgtaaactt 
tacgaaaaga 
gagtttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1977 



<210> 52 

<211> 525 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 52 

catcgctttt 

aaaaatggat 

tgtccaaaat 

cacaaagatt 

aagagccgag 

caaaacaccg 

aaaagaaaaa 

aacgagtttg 

ttctccacct 



cgctgaaatc 
ggtcatgatt 
cttctccagc 
gatgacatgg 
atgggagtag 
gctagttcct 
ctatgaaata 
ttgtttatta 
atgaattgca 



aaaatttctc 
ctaaggatac 
agatgcaaac 
gaggcagaat 
aaggcactcc 
cttaaaaagg 
tgttaagagc 
aagttcatca 
tttctatttt 



cagttttccg 
taagcagagc 
caggttccag 
caatgagctg 
tcctccagcc 
aatgtggtgt 
agtattactt 
aatagattga 
ggtctaaaaa 



atcagtcgca 
actgctgata 
acaatgtcgg 
gagcaaagca 
tccaaatcag 
tcattgacat 
ttaaaattcc 
tgatgtggtg 
aaaaa 



agaaaaccct 
tgactgcttt 
actccatcat 
tcaatgatct 
gcgatgaacc 
gtccgaagga 
tgttttaaga 
cattacatta 



60 
120 
180 
240 
300 
360 
420 
480 
525 
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<211> 2610 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 53 

agaacaattg agattcttgg ttgtgttaag atggaaatct acaccatgaa aacgaatttt 60 

cttgtactgg ctttgtcttt gtgtatcctt ctttcaagct tccatgaggt ttcttgtcag 120 

gatgatggta gtggtttgag taatttggat ctaatagaac gtgattatca agatagtgtc 180 

aatgctcttc aaggcaagga cgatgaagat cagtctgcaa agatacagag tgaaaaccag 240 

aataacacta cagtgactga taagaacact atttctctat ctctatcaga tgaatctgag 300 

gttggatctg ttagtgatga aagcgttgga cgttcgagtc tgttggatca aatcaaactt 360 

gaattcgaag ctcatcacaa tagtattaac caagctggat ctgatggtgt caaggctgaa 420 

tccaaggatg atgatgaaga attatctgct catagacaga aaatgttgga agaaatcgaa 480 

catgagtttg aagctgcttc agatagtctg aaacaactaa agactgatga tgtaaacgaa 540 

ggaaatgatg aagaacattc tgcaaagagg caaagtttgt tggaagagat cgaacgtgag 600 

tttgaagctg ctacaaaaga acttgaacaa ctaaaggtta atgacttcac cggggacaaa 660 

gatgacgaag aacactctgc aaagagaaaa agtatgcttg aagctattga acgcgagttt 720 

gaagctgcta tggaaggcat tgaagcactt aaggtttctg attccacagg aagcggagat 780 

gatgaagaac aatctgcaaa gagactaagt atgcttgaag agatcgaacg ggaatttgaa 840 

gctgcttcaa aaggtcttga acaactaagg gctagcgatt caaccgcgga caataacgaa 900 

gaagaacacg ctgcaaaggg acaaagtttg ttagaagaga tcgaacgaga gttcgaagct 960 

gctacagaga gccttaagca acttcaagtt gatgattcta ctgaagacaa agaacactgt 1020 

aaagcactct tcttcttatt atctgctatt ctttctctat ggttatctga atcaggcttt 1080 

gaatgtattg tagttacagc tgcaaagagg caaagtctgc tggaagagat tgaacgtgaa 1140 

tttgaagctg caacaaaaga tcttaaacaa ctaaatgatt tcactgaagg cagtgctgat 1200 

gatgaacaat ctgcaaagag aaacaaaatg ttggaagata tcgaacgcga atttgaagct 1260 

gctacaatag gtcttgaaca actaaaggct aatgatttct ctgaaggcaa taataatgaa 1320 

gaacaatctg caaagagaaa gagtatgctt gaagagatcg aacgcgagtt cgaagctgct 1380 

attggaggtc ttaaacagat caaagttgat gattccagaa atcttgaaga agaatctgct 14 4 0 

aagagaaaga taattttgga agagatggaa cgtgaatttg aagaagcaca cagtggtatt 1500 

aatgcaaagg ctgacaaaga agaatctgca aagaaacaga gtggctctgc tataccagag 1560 

gttcttggac taggacagtc aggtggttgt agctgttcta aacaagacga agattcctcg 1620 

attgttatac caacaaaata tagcatagaa gatatcctct ctgaagaatc tgcagtccag 1680 

ggaacagaga cttctagtct caccgcgtct ttgactcaac tcgttgagaa tcacaggaaa 1740 

gaaaaggaat ctctactcgg acacagagtt ctcaottctc cttctatagc ttcttccaca 1800 

agcgaatcat ctgctacatc agagactgta gaaaccctaa gggctaaact gaatgagctt 1860 

cgcggcttaa ccgctcgtga gcttgtgaca cgtaaagatt tcggtcagat tctcattacg 1920 

gctgcgagtt ttgaagagct aagttcagct ccaatcagtt acatttctag gttagctaaa 1980 

tacagaaacg tcatcaaaga aggacttgaa gcttctgaga gagttcacat cgcgcaggta 2040 

cgagcaaaaa tgctcaaaga agttgccacg gagaagcaaa ccgccgtgga cactcatttc 2100 

gcaaccgcta aaaagcttgc tcaagaagga gacgcgttgt tcgttaaaat cttcgcaatc 2160 

aagaaactgt tggcgaaact tgaagcagag aaagaatctg ttgatggaaa gtttaaggag 2220 

actgtgaaag aactttctca tcttctggct gatgcttctg aggcttacga agagtatcat 2280 

ggcgcggtga ggaaggcgaa agacgagcaa gcggctgagg aatttgcgaa agaggcgacg 234 0 

caaagtgcag agatcatttg ggttaagttt cttagttctc tttagagaac aattgagatt 2400 

cttggttgtg ttaagagcaa atctagagct cttgttggtt cttgttatgt attttgtgat 24 60 

gatgttctgt ttcagagttt gtgtgttggt tgtatcagga gaaagaggct gggagataga 2520 

gagaaagaga gtctctgcga aaactaataa tgttttttca gatatctaaa taataagctt 258 0 

tttacaaaaa aaaaaaaaaa aaaaaaaaaa 2610 



<210> 54 

<211> 2235 

<212> DNA 

<213> Arabidopsas thaliana 

<400> 54 
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aatttgaatc 
gggacatctt 
tctggaaatg 
ggcacctcct 
caagatgtcg 
gttctgggga 
gctacagaca 
tttagccaac 
gcgttcctag 
tataagcatg 
caattcgcta 
tttattgatt 
acagaaaaca 
gcgcgagcag 
tgtgggctag 
tctgtagcca 
gcagtggttt 
cagatggttg 
aatctcaaca 
gcagtgacca 
aagaatcttc 
accgttttgg 
aagcttctct 
gctgcttcag 
gctgggactt 
attaaccttt 
ctcaagttct 
cacctgaaaa 
ggagacttaa 
tgctattcag 
tggctagtcc 
agaatgtggc 
atgtttggct 
atcacctctc 
acagctctcg 
agagcataca 
gcaatgtctc 
tctggctaaa 



caatccccaa attatctcat 



ttgttatctt 
ctcccattta 
tgactcgaaa 
ttaacttatc 
tatttgcttg 
acaacataaa 
ttgataatct 
gagctgttta 
tctcttcatt 
ttcttgttag 
cttatttcag 
gcaaggttaa 
aagcaatatt 
tcggtaaaca 
aactggaaac 
tcttcacaac 
ataaatggac 
tcaagctctt 
ttctgtttta 
agaggattat 
agtctttcct 
tgtttctctc 
ggaagtactt 
tgttcaacac 
tggctactag 
ttatcggtta 
agaagtatct 
gctatgcgac 
tcattgctcc 
tcaggaatca 
cgcatattca 
acttaggagc 
tcatcttcgg 
aggtagcttg 
ttccgcatag 
gttatcaaga 
cccag 



cgtcattctc 
ttacccgaat 
cccttttgct 
cggcgtcgat 
ttccagtctt 
gaacacaaag 
atcaatggct 
ctggatatct 
gagagctcaa 
ggatatgcct 
agaaatatac 
taaaatatgg 
agcagcaact 
agtagacagc 
agagcagaaa 
cagggttgct 
tgtgaccgaa 
cagcagaata 
catgatacca 
tccgttcata 
tcctcagatt 
caaagccgag 
ttacttctcg 
agtgaaggat 
cctccctaag 
tggccttgag 
ctgcaaaacc 
tagggttccc 
tcttatcctc 
ggcgttgaaa 
ccagcgcata 
caagacattc 
atatgtgtgc 
ccgtgagctg 
cttgagctct 
tttcaacgca 



atggagtttg 
atgcttctct 
cggatcctta 
tggatgcgtg 
actgctgtcc 
cttctcctac 
aatgcgacag 
aacatcacaa 
ttggtcacat 
gctctgatgt 
gcaccacctg 
cctgagacat 
gaaaaattgg 
aataaccgtc 
attgagtatt 
gcggttcttg 
gctgcatcag 
gctcctgagc 
atccggcaat 
atcgcgttcg 
aagccggttg 
gcgctcattg 
gggattcctt 
gtctttaatg 
atcgcgaaaa 
agcgcaactt 
ctgtctcgga 
gaagcggagg 
ggagacatgc 
atattcggca 
gtgtacgttc 
ctagcagcgt 
ttctacacgg 
cgccagaaat 
aagcagagtc 
cacaaaccag 
atagcaggcg 



gatcttttct 

tcacctggct 

aagggctgga 

aagctttgac 

actttgtctt 

caactctact 

ataccacaag 

aaaaaagttc 

atttcttctt 

ctgctgatgt 

acgggcagac 

tctacagatc 

aaggttacaa 

ccacgaacaa 

acactgagct 

ctgagaagca 

cagctcagtc 

cacggcagct 

acttcatcta 

tctctgccat 

tggagataac 

ttttcttggc 

cacagagcca 

tcttcattgg 

atcccaaact 

tcttcctgac 

tcataccttt 

tcaaagaagc 

tcatcctcac 

tcacctactt 

catcatacga 

tgtttctatt 

cccttgtgat 

tctacggagg 

cagacctaga 

aagaacacga 

tttaaagctt 



tgtgtcctta 

ttctcgcaaa 

gccatgggaa 

ttcctctgaa 

cttgagcact 

gcctctagcc 

caaaggaact 

gaggctgtgg 

gtggaaagct 

aaaacccgag 

acagaaagag 

gcttgtcgca 

gaagaagctt 

aaccggcttc 

aatcaacgag 

gcaaaccgca 

tctgcactgc 

cctatggcag 

cttctttgtt 

caccactctt 

cgccataaga 

catgttgccg 

tgccattagg 

tgttaccctt 

cgacatgatt 

ctacgttgct 

gataatcttc 

ttggtacccg 

aatcacgttc 

tggtttaggc 

gagctatgga 

ccaagtggta 

ccctctcatt 

gttcgaacac 

ggagattttc 

gttcaaaggc 

gagagattcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2235 



<210> 55 
<211> 4002 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 55 

aacaataaga agaaaaagtt tcattttctg atggcggagc agaagagtac caatatgtgg 
aactgggagg tgactgggtt cgaatcgaag aagtcgcctt ctagtgagga aggcgttcat 
cggacaccgt cttctatgct tcgacggtac tcgatcccga agaactcgct tccaccgcac 
tcgtcggagc ttgcgtctaa ggttcagagt ttgaaggata aagttcagct tgcaaaggac 
gattatgtgg gattgagaca ggaagctact gatcttcaag agtactccaa tgcgaagctt 
gaaagggtta cacgttattt aggtgttctg gctgataaaa gtcgtaaact ggatcaatat 
gcacttgaga ctgaggctag gatatctcca cttatcaatg agaagaagag actgttcaat 
gacttactga cgaccaaagg tgcaqatctt ccatttccga cgtcattctc tatccttact 
tctattgata ttgatcacac cagaccctta tttgaagacg agggtccctc tatcattgaa 
tttcctgata actgcactat acgcgtaaac actagtgatg atactctgtc caatcccaag 
aaggaatttg aatttgatag agtttatggg cctcaagttg gacaagcttc actgttcagt 
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gatgtccaac cttttgtgca atccgctctg gatggatcga acgtttctat atttgcgtat 720 

ggccaaactc acgcggggaa gacatacacc atggttgccc ctcctttccc tttcctctct 780 

gaaattagat ataggtcttg tttggattta aatatgatag gcaagttcat ggacgttcat 840 

agtaagttca tggacgaagg atctaatcag gaccgtggtt tatatgctcg ttgttttgag 900 

gaacttatgg acttggccaa ttctgattca acttccgcat ctcagttcag tttctctgtt 960 

tcagtgtttg agctttataa cgaacaggtc agggatttac tctcgggttg tcagagcaat 1020 

ttgccaaaga tcaatatggg tttacgcgaa tcggttatag aactttcaca ggaaaaagtt 1080 

gataatccat cagagttcat gagagtcctg aactctgcat -t.tcagaatag agggaatgat 1140 

' aaatcaaagt ctactgtgac ccatctgatt gtctcgatac acatttgtta tagcaacaca 1200 

attacgagag aaaatgtaat tagcaagctt tctttagttg acctggctgg aagtgaaggt 1260 

ttaactgtgg aggatgacaa tggagatcat gtaactgatc tgctccatgt aacaaattca 1320 

atttccgcgc tgggagatgt tttatcatct ttgacg.tcaa aaagagatac cattccttac 1380 

gagaactcat ttcttacaag aatacttgca gattcactag gagggagctc caaaacattg 1440 

atgatcgtca acatttgtcc aagtgcacgg aacttgtctg aaataatgtc gtgtctcaac 1500 

tatgctgcta gagctcgaaa tactgtacca agcct.tggga atcgagacac aattaagaaa 1560 

tggagagacg tggcaaatga tgctcggaag gaggtattgg agaaagagag ggaaaatcag 1620 

cgtctaaaac aagaggttac gggtttaaaa caagcactta aagaagcaaa tgaccaatgt 1680 

gtactgctct ataatgaagt acagagagcg tggagagttt cattcacact gcaatcagat 1740 

ttaaagtcag agaatgcgat ggttgtagac aaacataaaa tagaaaagga gcagaatttt 1800 

cagttaagaa atcaaatagc tcaactttta cagttagaac aggaacaaaa gctgcaggcg 1860 

caacaacaag attccaccat tcaaaatctc cagtctaaag tgaaagactt agaatcacaa 1920 

ctaagtaaag ctctgaagtc tgacatgaca agatcgagag atcccttgga acctcagccc 1980 

agagcagctg agaacacact cgattcttct gcagttacca agaaacttga ggaagaattg 204 0 

aaaaaacgtg atgcactgat tgagaggttg catgaagaaa atgaaaaatt gttcgacaga 2100 

ttaacagaaa agtcagtggc tagctcgact caggtatcta gcccctcatc aaaagcttca 2160 

ccaacagtgc agcctgcaga tgttgacagg aaaaatagcg cgggcacttt accgtcttca 2220 

gtggataaaa atgagggcac gattacatta gtaaaatcca gctctgaatt agtaaaaacc 228 0 

actccagctg gagaatactt aacagctgca ttgaatgatt ttgatcccga acaatatgaa 234 0 

ggtcttgcag ccatagctga tggcgcaaac aagcttctga tgctgg±ctt agcagctgtc 24 00 

ataaaggctg gtgcttccag agagcatgaa atccttgc.tg agatcagaga ttctgtcttt 24 60 

tcatttatcc ggaaaatgga accaaggaga gtaatggata caatgcttgt ttctcgagtc 2520 

aggatattgt acataaggtc cttacttgca cgatcacctg agcttcagtc gatcaaggtt 258 0 

tctcctgttg aacgcttttt ggagaagcca tatactggtc gaactagaag ctccagcggg 2640 

agtagcagcc caggtagatc accagttcga tattatgatg agcagattta tggctttaaa 2700 

gttaatttaa agccagaaaa gaaaagtaag ttggtatctg tagtttcaag aatccgtgga 2760 

catgaccagg atactgggag gcagcaagtg actggaggaa agctgaggga gatacaagat 2820 

gaagccaaaa gttttgccat tggaaacaaa cccttagctg ctttatttgt tcacactccg 28 80 

gctggtgaac tgcaaagaca gattaggtca tggcttgcag aaagttttga gtttctctct 294 0 

gttacagcag atgatgtttc aggagtaacc actggccaat tagagcttct ttccacagca 3000 

attatggatg gctggatggc tggagtagga gctgcggtgc cacctcacac agacgcttta 3060 

ggacagcttt tgtctgagta tgcaaaacga gtctacactt ctcagatgca gcatctaaag 3120 

gatattgccg gtactttggc ttcggaagag gcagaagatg ctggtcaagt cgcgaagctt 3180 

cgatcagctc tcgagtctgt tgaccacaaa agaagaaaga ttttgcaaca aatgagaagt 324 0 

gatgcagctt tgtttacctt ggaagaaggc agttcccctg ttcaaaatcc atctacagca 3300 

gccgaagact cgagattagc ctccctcatt tctctggatg ccatactgaa gcaagtcaag 3360 

gaaataacaa gacaagcctc tgtccacgtt ttgagtaaaa gcaagaaaaa ggcattgctt 3420 

gagtctcttg atgaacttaa cgaacgaatg ccttctctgc ttgatgttga tcatccatgt 34 80 

gcacagagag aaattgatac ggctcaccag ttggtcgaga caattccaga acaagaggac 354 0 

aatcttcaag acgaaaagag accttcaata gattcaatat cttcgactga aaccgatgtg 3600 

tctcaatgga atgttttgca attcaacaca ggaggctctt cagctccatt catcataaaa 3660 

tgcggagcta actccaactc agagctcgtg atcaaagcgg atgcccgtat tcaagaacct 3720 

aaaggaggcg aaatagtgag agttgtgcca agaccttcgg ttttagaaaa catgagctta 3780 

gaggaaatga aacaagtgtt tggtcagttg cccgaagctc taagttcact ggccttagct 384 0 

agaacagctg atggcacacg ggctcgatac tctagactct acagaactct agccatgaag 3900 

gttccctctc ttagggacct cgttggagag cttgagaaag gaggagtctt aaaagataca 3960 

aaatcgacat gataggatta gggttttttc gtgaatttga aa 4 002 
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<210> 56 

<211> 1251 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 56 

ttagttagat aggcggtggt tggtgcgttc 
gcgatcggtg gagttgagag tccagtgacg 
agtaacaaca acaacccacc gactatgact 
accaccaaca acagtggaag ccctaatacc 
agagacgagc aaccagctgt tgaacccgga 
agaggtagac ctcctggttc caagaacaaa 
agccctaact ctctccagag ccatgttctt 
agcttaaacg cctttgctcg tagacgcggc 
ttggttacta atgttactct gcgtcagcct 
ggtcagtttg agatcttgtc tatgtgtggg 
gcagccgctg gtttaaccat ttacttagct 
gttgctggcc cgcttattgc ctctggaccc 
gccacttatg agaggttacc gattgaggaa 
gaagatggga agaagcagaa agaagagaat 
gaaggatcga tgcagccgcc gatgtataat 
caaatggctc aacacgacgt gtattggggt 
tgattagtta gataggcggt ggttggtgcg 
ttaggatggt taggcttttg tttattaaag 
ttcggatgac aattagctag tgtttgtttg 
tattttgatc gcggatagct ttgtatgaaa 
tttgatgttg gaaaaaacca agcaatggtg 



atggcgaatc cttggtgggt agggaatgtt 60 

tcatcagctc cttctttgca ccacagaaac 120 

cgttcggatc caagattgga ccatgacttc 180 

cagactcaga gccaagaaga acagaacagc 24 0 

tccggatccg ggtctacggg tcgtcgtcct 300 

ccaaagagtc cagttgttgt taccaaagaa 360 

gagattgcta cgggagctga cgtggcggaa 420 

cggggcgttt cggtgctgag cggtagtggt 480 

gctgcatccg gtggagttgt tagtttacgt 54 0 

gcttttcttc ctacgtctgg ctctcctgct 600 

ggagctcaag gtcaagttgt gggaggtgga 660 

gttattgtga tagctgctac gttttgcaat 720 

gaacaacagc aagagcagcc gcttcaacta 780 

gatgataacg agagtgggaa taacggaaac 840 

atgcctccta attttatccc aaatggtcat 900 

ggtcctccgc ctcgtgctcc tccttcgtat 960 

ttctttttac tggaatgatt atattttcca 1020 

ctatcaagtt tctttttttt ttacggataa 1080 

tttgttttgt ggcggctttt ctgacttgac 1140 

gtgaattgat tgtagaatcg tcttttgaat 1200 

tgtggccttt gcaatggaag c 1251 



<210> 57 

<211> 2955 

<212> DNA 

<213> Arabidopsis thaliana 



<40O> 57 

aatttgcttt 

gaatgtgtaa 

gaagaagttt 

acccatgtca 

cgacaacctg 

cagaggatca 

tcttcgatcc 

gttgagaaag 

ttgttctcgc 

aaagagatta 

cttggtgtgg 

ttgttggatc 

ggcggtttca 

ctctcggttg 
ttaaagctca 
gacagttcgg 
gagatcgaaa 
ggaggtggga 
catctcatga 
aaacacacca 
ttcactaaga 



atctttgcat 
tccggatgtt 
acaagatctt 
aggatctgcg 
ttcatgatgt 
tagccgatgc 
ctcgtaagtt 
tctcgaaagt 
tggctcagca 
atgagattat 
ttcctgttat 
agaaactgag 
tgagctcgtt 
ggattcagca 
accaattgaa 
acccattctg 
cacagttgaa 
gcggcgctga 
acactgtgaa 
atatcgcaac 
aagaaagcga 



tgttgttggc 
gaatctgaac 
gatttacgat 
taagcatgga 
tcccgctgtc 
ttctagatct 
tcttgaagag 
gcatgatcag 
atctacctat 
cgaaagggtc 
ccgatgccct 
ggatcatctt 
tcagcgtccc 
tgatttcaga 
agtgcaggga 
gtcagcaaac 
caagtacaag 
gtttgatggg 
atcgctcccg 
agcgctctta 
catgatgatg 



atggctctca 
caacctttga 
aggttttgtc 
gttacactct 
tactttgttc 
ctctacgata 
ctagcttctg 
tatctggagt 
gttcaaatga 
gctagtggtt 
agtggtggac 
ttgtccaaga 
ctcttgtgca 
taccggcctc 
gagaaaggac 
agtactctgg 
agagacgttg 
acagatctga 
gagttaactg 
ggacagatca 
aggggcggaa 



atctccgtca 
atccaagtgg 
agaacattct 
tctttctcat 
aaccaactga 
cctttcatct 
ggactcttaa 
ttgtgacttt 
atgacccatc 
tgttttgtgt 
ctgcagagat 
acaatctgtt 
tatttgatag 
tcgttcacga 
caccgaaatc 
agtttccaga 
aagaggttaa 
ttggaaacat 
agcgaaagaa 
aggagagatc 
tcgacagaac 



gaaacagact 
aactgcgaac 
atctccattg 
agacaaagat 
atccaacctc 
gaatttctcg 
atctggttct 
ggaagataac 
agcaggggag 
gttggtaacg 
ggtggcgtct 
tactgaaggt 
gaactttgag 
tgttctcggg 
gtttgagctg 
tgtcgctgtg 
caagaaaacc 
ccacaccgag 
agtgattgac 
tattgacgct 
tgaacttatg 
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gctgctctga aaggcaaagg gacaaagatg gacaagctcc ggtttgcaat catgtacctg 1320 

atctccacag aaaccataaa ccaatcggaa gttgaagcag tggaggcagc attgaatgaa 1380 

gctgaggctg atacaagtgc gtttcagtat gtaaagaaaa tcaaatcgtt aaacgcatct 14 4 0 

tttgcagcta catcagcgaa ttcagctagc agaagcaaca ttgtagactg ggccgagaag 1500 

ctttacggac agtctataag cgcagtgact gcaggagtca agaatctgtt atctagtgat 1560 

caacaattgg cagtgactcg aacagtcgaa gctttaacag aaggaaaacc aaacccggag 1620 

atcgattctt accgcttcct ggacccaaga gctccaaagt cgtctagctc cggtggtagc 1680 

catgtaaaag gaccgttcag agaagctata gtgttcatga tcggtggagg taactatgtt 17 4 0 

gagtatggaa gtttgcagga gttgactcag agacagttaa ccgttaaaaa cgttatttat 18 00 

ggagccactg agattcttaa cggaggtgag ttggtggagc agcttggact tttgggaaag 18 60 

aagatgggat taggaggtcc ggtcgcttca acgctgaaga ggctgggaat ggctggtaaa 1920 

gaggagactg atgtatctgc acaagggtct ttaaccaggg aggccactga gatatggagg 1980 

agtgagttgg aatctcgccg gtttcaggta gatagtttag aagctgaact tgtggatgtc 2040 

aaggcttacc ttgagtttgg ctcagaagaa gatgccagaa aggagttagg agttctttcg 2100 

ggtagggtca gatcgactgc aactatgttg. cgttatttga gatcaaaagc tagagtcttg 2160 

gccattcctg atgatctagc aaatgtgtca tgcggtgtgg aacagattga agaactgaaa 2220 

ggattgaacc ttgttgagaa agatggtggt tcatcttctt ctgacggggc taggaacact 2280 

aatcctgaaa ctagaaggta cagtggttcc ttgggtgtag aggatggagc ctatactaat 2340 

gagatgctcc agtccataga gatggttact gatgtgctgg actctcttgt gaggagggtt 24 00 

acagtagcag aatctgagtc tgctgttcaa aaggagaggg cacttttggg agaggaagaa 24 60 

atcagtagga agactatcca aatcgaaaat ttgtccgtga agttagaaga gatggaacga 2520 

tttgcttatg ggactaatag tgttctaaac gaaatgcggg aaaggattga ggaattagtt 2580 

gaagagacga tgaggcagag ggaaaaagct gtggaaaacg aagaggagtt gtgtcgtgtg 2 64 0 

aagagagagt tcgagtcgct taaaagctac gtcagtactt ttaccaatgt tcgagaaaca 27 00 

cttctttcgt ccgagagaca attcaaaacc attgaggagc tctttgaacg gttggtcact 27 60 

aagacgacac aattagaagg ggagaaggca caaaaggagg ttgaagtaca gaaactgatg 2820 

gaggagaatg tgaaattgac agcacttctc gacaagaaag aggctcagct tctagctttg 28 80 

aatgaacaat gcaaagttat ggctttgagt. gcatcaaaca tatgactctc taatccaacc 2940 

gaatctcaag cttcc 2955 

<210> 58 
<211> 865 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 58 

ggctgataaa tatagggaga actatttggg tcacagtatc aaagcccctg ttggaagatg 60 

gcaaaaaggt aaagatcttc attggtatgc tagagataaa aagcaaaagg gttccgagat 120 

ggatgctatg aaagaagaga ttcaaagagt taaggaacaa gaggagcagg ccatgaggga 180 

ggctcttggc ttggcaccaa agtcctctac aaggccacaa ggaaatcgcc ttgataagca 240 

agagtttact gaacttgtga agaggggttc gacagcagag gacttaggtg cagggaatgc 300 

tgatgctgtg tgggttcacg gtcttggata tgctaaagca ccacgacctt gggaagatcc 3 60 

gagcaccctt gcatcctctc agaaagaaga tgcagattca gcacgcttac cagcagatac 420 

atcaggggtc aaaactgttg aagatggacc ggatgatgtt gagagggacc aaagaaggat 4 80 

aggcgtgagg aaaggaaacc tgcaaagaga gagaaggaag aaagacatga taggcgtgaa 54 0 

aaacgcgaaa ggcatgagaa gcgaagcgct cgtgattcag atgatagaaa gaagcacaag 600 

aaagagaaga aggagaaaaa aagaaggcat gactctgatt ctgattgaag cgaattgtcc 660 

caggatggaa cattttgctc ttcagaggaa gagtggtcgg ctaggtacca aaatccagct 720 

accacttctg caagatttaa atctgttgct tatttcattt acgaatcgtg gagtaaagtg 7 80 

ttgttgaaca ttgttgaaaa tgtttgttaa aacacatgaa aaatgtggtt tgatattata 840 

acaaaccgag acgctcgttt tagct n 8 65 



<210> 59 
<211> 723 
<212> DNA 
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<213> Axabidopsis thaliana 
<220> 

<221> mis cofeature 

<222> 559,622,677 

<223> n = a,g,c, or t 



<400> 59 

gcaaaagaga 

ggggaaagat 

cgacaccgat 

cggcggaaac 

accgtttgat 

cgatcgccag 

tgtggcggat 

cgatgagtgg 

catagcaagg 

taagaagaga 

ctaggggttg 

tttcaatttt 

aaa 



gaaacatctg 
ggtctgagcg 
ggcgacggca 
ccgacccaag 
ttcaacagat 
ctccgtgacg 
ctgaggcata 
atcaaggagg 
atggttgcta 
ttcttttgng 
gtttatcatt 
taatggnaaa 



acccggaatc 
acgatcaggt 
aaatcgcacc 
cccagctgaa 
tcctcgatct 
cattcaaagt 
ttctgaccag 
tggatgttgg 
agtgagatct 
gttttttcac 
gnggaatttg 
tggtggctgg 



tgacctgaaa 
ctcgtcgatg 
gtcagagctc 
atccataatc 
catggcgaaa 
gctcgataag 
tatcggagag 
atccgatgga 
aatcttttat 
ttggttggtt 
cttactaact 
gggaattcct 



acccggaaga 
aaggaagcct 
gggatcctca 
gcatccgaga 
catctgaaga 
gaaggtaccg 
aagctggagc 
aagatccggt 
gttttgaaag 
tgatttcgag 
ttggcttctt 
aaaaaaaaaa 



atcgaaaaat 
tcatgctctt 
tgcgatctct 
atctctcttc 
cggaaccttt 
ggttcgttgc 
ctaatgagtt 
atgaagattt 
ttgaaatttt 
cgaatcctaa 
catggttggg 
aaaaaaaaaa 
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<210> 60 

<211> 426 

<212> DNA 

<213> Axabidopsis thaliana 



<400> 60 

caaaaaaaga 

ctctccaact 

ttttgcaatc 

gtcaagatca 

tgttccgatt 

ttggattcat 

gcaaactcta 

aaaaaa 



gatcgcttca 
tctcaattgc 
tctcagagaa 
cgactctgag 
ttcttttatc 
aagctgggtc 
aagagcatat 



atggagaaac 
gtcgcggagt 
tgcgttctat 
ggagcagctt 
gtttgagttg 
atgccttgtt 
ttataagaag 



agagtactca 
ctcctttcga 
caaagaaagt 
cagctacaaa 
taatcatgta 
tcccctttgt 
aaataaaagt 



accaatttgc 
tcaagagaaa 
aaagaagttc 
gagaccttca 
attgatttta 
tgtcttgtat 
ttctacaaaa 



ggccaagagg 
tgcgtccgat 
tcgataccga 
taacgttctt 
atgtcatgcc 
gttgaatatt 
aaaaaaaaaa 



60 
120 
180 
240 
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<210> 61 

<211> 1442 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 61 

tcaaaatcag 

ttctccagta 

ttctaatcac 

gggcacggcg 

ccggagtgag 

gaataatttg 

gaagaagagg 

tagtatgaaa 

aatttgtgaa 

tgttgctgaa 

tgagaaaaac 

aatatccaag 



aaactttcct 
gcgcggtagt 
aaccaccatg 
gtttcaggtc 
caaaccatca 
gacattcaag 
ggacagcgtg 
ggtcttatct 
aaggtggaaa 
tttgcacttc 
ataagacgaa 
gataaaaaag 



tgacaaattt 
tagttttagg 
aaagcaataa 
aatctgtgtc 
ccgttgttac 
gtgatgatgc 
cggctggtcc 
ctttctctgc 
gcaaaggaag 
caaataacga 
gagtatatga 
aaattcaatg 



taacaaatct 
tggaagaaga 
taacaacaat 
tactagcggc 
atctactagc 
tggttctcaa 
agataagact 
ccctattatg 
gacaacttac 
tggaacatcc 
tgctttaaac 
gagaggtctt 



ctttctcgtt 
atgacaacta 
aaccctagta 
agtatgggct 
gacactactt 
ggagcttctg 
ggaagaggac 
ctttcatcta 
aatgaggttg 
cctgatcagc 
gtcctcatgg 
cctcggacaa 



ttctattgaa 
ctgggtctaa 
ctaggtcttg 
ctccgtcgag 
ttcaacgcct 
gtgttaagaa 
tacgtcaatt 
aatgcctttc 
cagacgagct 
aacagtatga 
ctatggatat 
gcttaagcga 



60 
120 
180 
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cattgaagaa ttaaagaacg aacgactctc acttaggaac agaattgaga agaaaactgc 780 

atattcccaa gaactggaag aacaagtaat gaacatcatc gatactctcg gcttatctgc 84 0 

ttcctgcctt cagaatctga tacagagaaa tgagcactta tatagctcag gaaatgctcc 900 

cagtggcggt gttgctcttc cttttatcct tgtccagact cgtcctcacg caacagtaga 960 

agtggagata tcagaagata tgcagctcgt gcattttga.t .ttcaacagca ctccatt.tga 1020 

gctccacgac gacaa±tttg tcctcaagac tatgaagttt tgtgatcaac cgccgcaaca 1080 

accaaacggt cggaacaaca gccagctggt ttgtcacaat ttcacgccag aaaaccctaa 1140 

caaaggcccc agcacaggtc caacaccgca gctggatatg tacgagactc atcttcaatc 1200 

gcaacaacat cagcagcatt ctcagctaca aatcattcct atgcctgaga ctaacaacgt 1260 

tacttccagc gctgatactg ctccagtgaa atccccgtct ettccaggga taatgaactc 1320 

cagcatgaag ccggagaatt gaaacacgta tgaaggcccc ttgtacaatt tctgtaaaac 1380 

tgtaaagtag ctcttgaaaa actttacctg gttttttgac gaatagtctg tttagcggta 1440 

aa ■ 1442 



<210> 62 
<211> 1506 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 62 

atggcgctgc agaacattgg tgcttccaac cgtaacgatg ccttctacag gtacaagatg 60, 

cctaagatgg ttaccaaaac cgaaggcaaa ggtaatggca ttaagaccaa cattatcaac 120 

aatgttgaga ttgccaaagc cttggctaga ccgccttctt atacgaccaa gtactttggt 180 

tgtgagcttg gagcgcagtc taagtttgat gagaagactg ggacgtcgct tgtgaatgga 240 

gctcaeaaca cgtctaagct tgctgggctt ttggagaatt ttattaagaa gtttgttcag 300 

tgttatggat gtggtaaccc ggagactgag attattatta cgaagacgca gatggtgaat 360 

ctcaagtgtg ctgcttgtgg gtttatctct gaggtcgaca tgagggataa gttgactaat 420 

ttcattctca agaacccacc tgagcagaag aaggtgtcaa aggataagaa agcaatgagg 480 

aaagctgaga aggagaggct taaagaaggc gagctagctg atgaggagca gagaaagctg 54 0 

aaagctaaga agaaagcatt gtctaacggc aaggattcta agacgtctaa gaaccattct 600 

tctgatgagg atataagccc gaagcatgat gagaatgctc tagaggtgga tgaggatgaa 660 

gatgatgatg atggtgtcga gtggcaaact gatacttccc gagaagctgc tgagaaaaga 720 

atgatggaac agttgagtgc taaaactgcc gaaatggtga tgctctctgc aatggaagta 780 

gaagagaaaa aggcgcccaa aagcaaatct aacgggaacg ttgtgaaaac tgagaatcct 84 0 

cctccgcaag agaagaatct cgtgcaggat. atgaaagagt atctgaagaa agggtcacca 900 

ataagcgcgc tcaaaagttt catctcgtct ctctctgaac ctcctcaaga catcatggac 960 

gcactcttca atgctctctt tgatggtgtg ggaaagggat tcgccaaaga agtgactaag 1020 

aagaagaatt acttagcggc tgctgcaaca atgcaagagg atggatcaca gatgcatctg 1080 

ctcaattcga ttgggacatt ctgtggaaag aatggaaacg aagaagcttt gaaagaggtg 1140 

gctctggttc ttaaagcatt gtacgaccaa gacatcattg aggaagaggt agtgttggat 1200 

tggtacgaaa agggtctcac cggagctgac aaaagctcgc cggtttggaa gaatgttaag 12 60 

ccttttgtgg agtggcttca gagcgctgag tctgagtccg aagaggagga ttgagtcact 1320 

tttttcttcc ctcctaactt ttctttgcgg catttcttat aatacttcgt cagttttcag 1380 

aattcttaaa tctttttgct gtgttcttat aaagaaacat catctattaa agttgtcttc 1440 

gtttggattt ggttttgacg actttgggaa atatttatgt ttaagaaaaa aaaaaaaaaa 1500 

aaaaaa 1506 



<210> 63 
<211> 2631 
<212> DNA 

<213> Arabidopsxs thaliana 
<400> 63 

atggcggcta acaaattcgc gactctgatt catcggaaaa caaaccgaat cactttaatc 60 
ctcgtatacg cttttctcga atggtcactc attttcttca ttttgctcaa ctctctcttt 120 
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tcttatttca 
tctagactcg 
tgcgatgatc 
ggagaatttc 
ctatgtgctc 
cagatttaca 
ggatctgtaa 
catgagaaac 
gattgtaaga 
atagaggtgg 
gactgtcatt 
gacattaacg 
gagacggagg 
acgaatctag 
ataggtacag 
cctcaccacg 
gaaaccaaga 
atggaaagct 
gttgatggta 
caagaagaga 
tctgctgttg 
gcgatgcaga 
caagaagctt 
cttgagaagg 
atggggatgt 
gattctgatg 
tggaaatata 
gagtgtttag 
gaagagaaac 
gagagtaatg 
cacagagtta 
gaaaacgggt 
gagaatgtga 
gcagatagag 
catctcctcc 
gtcagagaaa 
ctctttgcat 
cttgtgaggc 
aagtatttta 
ttcattttca 
gtgtcagtat 
aatagaaaaa 



tactcagatt 
atcgtttctt 
atgctctcca 
acaatgattt 
cgattgagtc 
atggtcttaa 
atttgaatga 
ttgaagatga 
acaaagaaat 
aaactgcaga 
tgattccagt 
gagattttat 
aaatctccga 
ttgcttcgga 
agattcctga 
atgacgatga 
tgccagtctt 
cgcatagtag 
ttgaatgtcc 
gaaaagcact 
ctgccagtga 
tggaagcgtt 
tgcagttgtt 
agctagaggt 
tgaggaggag 
agaatagcaa 
gagagaatga 
atgattatga 
tcacagatct 
gtagcatcaa 
tcaagtcaaa 
taagtaacgg 
cgatagaaga 
agttcttaag 
atgagattct 
acggagacat 
agtgacccaa 
aaggaacaac 
agtatatact 
ttattataag 
aatttcactt 
ggtagaatga 



cgctgattat 
cgatgcttct 
attacattca 
ggttcatcgt 
tgactttggg 
gtttcctcga 
ttctcaggaa 
tgatgttgat 
tgcaacagag 
atcagctccg 
tgaattctat 
cctcgatttc 
ctttgcttcg 
aatggaaaac 
tcatgagcaa 
tcatgaggag 
aaacatcaac 
tctacataac 
tgaaggagta 
tcacgcgtta 
aacaatggcg 
gcagtatcag 
gaatgagctt 
gtatagaaag 
attgagagat 
tggagagtta 
gatggagaat 
tggagagagg 
taataacgaa 
tggaaatgag 
gagattactt 
aaaccatcac 
agaagtggat 
acattgtgtt 
gcaacatctt 
gagtttatga 
agaacaagaa 
gacaaggttt 
ctgaggaaaa 
ttttgttttt 
ttatatttta 
tgagtgaaga 



ttcggtctta 
ggtaaatctc 
aaacctgttg 
ggttgttgcg 
aatttagatt 
tcgatcttcg 
gaaacagagg 
gaggagtttt 
aaggaagaag 
aaaaacctcg 
aaaccgagtg 
ggcgttgagc 
ccgggtgaat 
gacgacgaag 
atcggagata 
gaaacgttgg 
gaagagcgga 
gctatgtttc 
ctcactgttg 
tacgaggagc 
atgatcaata 
agaatgatgg 
atggtgaata 
agaatggagg 
tcctctgttg 
cagtttaaga 
acgccggtgg 
ctttcgattc 
gaggacgacg 
catattcatg 
cccctgtttg 
gaaaacgggt 
gagctttacg 
ggttcattga 
cgtgatctaa 
gtttgatttt 
aaatcataca 
cagatgaaga 
gtgtcagatc 
tatattgaag 
tatttgggag 
aaaaaaaaaa 



aacgtccttg 
cttctcatcg 
aagaatctaa 
tagagaagat 
atccaattgg 
tctttgaaga 
agaagaaagt 
catgctatgt 
aaaacagagt 
agttctatat 
aagaagttcg 
atgatttcac 
cgaaaccgga 
aaacagacgc 
ttccttctca 
agttcaaaac 
ttttagaagc 
acttagagca 
ataagttgaa 
tggaggtaga 
ggttgcatga 
aggagcaagc 
gagagaagga 
agtatgaagc 
attcgtatag 
acgttgaagg 
atgttgtact 
ttgggagatt 
aggaggaggc 
gcaaagaaac 
atgcggtcga 
ttgatgattc 
agaggttaga 
aaaaaggaga 
ggaatatcga 
gagttttggg 
ggtatggaag 
agaaaacgtt 
agaatgttcg 
atttatttag 
ttgtctttat 



tctcttctgc 
agatcttctc 
ttgtggtttc 
aagttcgtca 
agatgaaggt 
agagaaagta 
tccccaatct 
atcaagcttc 
ggatctacct 
tgatgaagaa 
agagatttcc 
ggcggcggcg 
ggatgcagag 
agaggtttct 
ccagctcatt 
agtaacgatt 
tcaaggctcg 
aagagtatct 
gtttgagtta 
gaggaatgcg 
ggagaaagct 
tgagtttgat 
gaatgctgag 
taaagagaaa 
aaataatggc 
ggttacggat 
tcgtcttgat 
gaagtttctt 
taaaacgttt 
aaacgggaag 
tggagagatg 
ggagaagggt 
agctctagag 
caaaggtgta 
tcttactcgc 
tttgagtcca 
tgacatgttg 
ctcagaataa 
tctttcttcg 
agagagggaa 
gagtggtggt 



aaaaaaaaaa a 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
, 1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2631 



<210> 64 
<211> 2743 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 64 

atgtcagacg ctctttctgc gattccggcc gcagttcatc gcaatctctc cgataaactc 60 

tatgagaagc gcaaaaatgc tgcgcttgag cttgagaata ttgtgaagaa tctaacttct 120 

tcgggtgatc atgacaagat ctcgaaagtc attgagatgt tgattaagga atttgccaaa 180 

tctcctcaag ctaatcatcg gaagggtggt ctaattggct tagctgctgt aactgttggt 240 

ttgtctacag aagctgctca atatcttgag caaatagtgc cacctgtgat taattccttt 300 

tctgatcaag atagccgagt tcggtactat gcatgtgaag ctctctataa cattgcaaag 360 

gttgtgcgag gcgatttcat tattttcttc aataagatat ttgatgcctt atgcaaactc 420 
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tcagcagatt 
gatattgtga 
cgaatgaacg 
gatagtgttc 
aatatgttga 
tttcttcaag 
gtgcagaggg 
gagttcgtaa 
atcttgcctt 
gaagaacttc 
tctgttgcaa 
tggatatcaa 
tttgacaccc 
gaggttcatg 
gtccacaatt 
ttagttggtc 
gtacttttgg 
aatcttgact 
gagttatcga 
cttttcgttg 
ttattagctc 
attaacgtca 
tttacttacc 
ctttatggtc 
agactcaaaa 
tcaggagttc 
aacatcaaca 
gtacagaatc 
tcttcttcta 
caacaacagc 
gcagataaca 
taacctacct 
atcttctttg 
tttttagtaa 
tctctttctc 
tacctttata 
gctttttata 
tgttgttttt 
tatgtcaaat 



ctgatgccaa 
cggaaagtga 
ttctcaaccc 
cagacattga 
gcgactc.tag 
agataaaaaa 
ctgcttctcc 
aacttggggg 
gcatatctga 
gttcaatcca 
ggaggcagct 
cacttttaaa 
ttctaaaagc 
ctggtgtagc 
tccgagctga 
aaaacatatc 
atgccgaaag 
ttgcttctac 
aactgagaga 
ccttgtatac 
aggcttacca 
aatttctagt 
ttagattgca 
ttcttatgtt 
ctgtgccaac 
ctttctctca 
gttctcacca 
tacatcgtgg 
catcaaagga 
aacaacaaca 
atagacctcc 
ggtaatcata 
tgtggacacc 
ctcttcaagt 
tctgtcgggt 
ttgttttttt 
tttttacctt 
ttataactgg 
gatgctcctt 



tgtccaaagt 
tcagttcagt 
ttacgtccgg 
catgcttggg 
.t.catgaaata 
ttcaccatct 
tgatgaattc 
agaccagctc 
caaagaagag 
tgttgaaccc 
atcaagtgag 
caagcatcgt 
actatctgat 
aaaagatcca 
taattctctt 
tcgttatagg 
agtctaccga 
catggttcag 
actattaaaa 
ttcatggtgc 
gcatgcgagt 
acagcttgat 
gcttctggaa 
acttcctcag 
gtactcattc 
gtataagcat 
aggaatcaat 
ccaggcaagg 
ggtgaggaga 
acaacaacga 
atcaagaact 
aataataaat 
accgatccct 
ttcgacggaa 
tttttttatt 
gggggtttct 
atgtggagat 
aaaaccacat 
tttacaaaaa 



gctgctcatc 
attgaggaat 
caatttctgg 
tttctgccag 
cgacagcaag 
gtagattatg 
actcgattaa 
gtgcgttatt 
aaaatcaggg 
tcagatggtt 
tttgaggcta 
actgaggtct 
tcttctgatg 
caacactttc 
ttggaaaggt 
cgcggtgccc 
gagctctcta 
gcattgaatt 
ggttcactcg 
cattcaccca 
gtcgtgattc 
aaattgatcc 
ccaggaaggt 
caaagtgcgg 
agtactggaa 
caaaacgagg 
tttgctgtgc 
actagagtga 
tctgaagaac 
ccaccacctt 
tcaagaaaag 
aatattccat 
tttgtctcct 
cttgtggaaa 
tacgagagat 
cgtttctttg 
gtaagaaaaa 
gagtttgtag 
aaaaaaaaaa 



ttttggatcg 
tcatacctct 
ttggatggat 
actttctcga 
ctgattcagc 
gtcgcatggc 
cagccatcac 
atgctgacat 
tggttgctcg 
ttgatgttgg 
ctcggattga 
tgtgcttcct 
acgtggtgct 
gccagctcat 
atctggaaag 
ttattgtccg 
caattcttga 
tgattttgct 
tcaatcgcga 
tggcaattat 
aatcattggt 
ggcttctgga 
acacatggtt 
cgttcaagat 
accaaatagg 
acggtgactt 
ggctacaaca 
actactcata 
aacaacagca 
cttcgacatc 
gccctggtca 
ccccgacaat 
gtaaaattgt 
agctacggtc 
tcttcttcag 
aatttgtttc 
gaagtgatca 
aggtcactta 
aaa 



ccttgttaag 
tttaaaagag 
cactgttctt 
tgggttattc 
tctttcagag 
tgaaatactg 
gtggataaac 
tcttggggct 
tgaaaccaat 
cgcaattctc 
agcattgaat 
gaatgacata 
cttggttctg 
cgtatttctt 
aacatattat 
aagaatgtgt 
gggagaagat 
tacttccccg 
agggaaagaa 
aagcctctgc 
agaagaagac 
aactccaatc 
gctgaaaaca 
acttaggaca 
cagagcaact 
agaagacgat 
gttcgaaaac 
tcactcttcc 
gcagcagcaa 
atcatcagtt 
attacagctt 
catcatcttc 
atatctctct 
gtgtccatca 
tccctcagtc 
attgtttgga 
tgtggttttg 
ttggatattt 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200. 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2743 



<210> 65 

<211> 2959 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 65 

atgtcactct 

cgtgccgccg 

aggaggagga 

agtggtgaag 

atggaacgag 

gctgctgggc 

aacggcgacg 

ccgcttcctg 

agaggcctag 

ctcattcttg 



tgttcctcaa 
gaatatcctc 
agcagaagcg 
ctgtctccgc 
ctagaaatcg 
ttagccctgg 
aacaaggcgc 
agactatgat 
aaatcctcgc 
ttgaggagct 



tcctccgttt 
cattcgatgc 
cggcgacgga 
tctggagagg 
agatacttca 
acctcgttct 
gatgcactcg 
tgctttggtt 
cgctatggaa 
catgaggatc 



ccctccaatt 
tcaatttctg 
gctgagaatg 
agtctccgcc 
ggtgtttctg 
ttccatggtt 
ctgagaaagg 
cgtctctctg 
aagcttaagt 
aatcacttgg 



caatccaccc 
caccggagaa 
acgactcttt 
tcacttttat 
aggttatcta 
tggttgtagc 
agctaggtgc 
gttcgaaagg 
atgacattcg 
aagatgcgaa 



aattcctcgt 
gaaaccgagg 
gtctttcgga 
ggacgagctt 
tgacatgatt 
tcacgcgctt 
aggccaacgt 
gaatgctacg 
tcaagcttgg 
taaagttttc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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ttgaagggtg 
gaagattgca 
gctggtagaa 
gggattcccg 
atgaagcctg 
tatgataggg 
gtgcagccaa 
gtgaaggaag 
ttacacaatg 
gaaggaagaa 
atacctccaa 
gaaccattgc 
gaggaagggg 
ttagatcccg 
agatgccttg 
ggtcttccag 
cggaacataa 
gttgtatcag 
aatgtgcttt 
ctttgggttc 
atatgtcgaa 
gaaggcttga 
gaatcggaga 
gaggaggaac 
ccagaaactg 
aagaagcatc 
acaaagaaac 
tggttccctg 
gtggctgaca 
aacaaaactc 
acaaaggtga 
gctgctttaa 
cttggctact 
ggagaacttg 
cctgatcaaa 
gagcctgaag 
gtgaagtctc 
agttttgttc 
gttgcaccag 
gaaaaaaaaa 



caagaggtgg 
aagctggaga 
tggccacaac 
aggtagctta 
acactgagac 
ttcaggatgt 
atgtgaagac 
cgattagaca 
cagggaattt 
ttgttgagct 
gagccatgat 
aagaagaagc 
gacttactgg 
atgcttctgg 
aagattggaa 
ttctaggaga 
taaaaggtcc 
agttaaagga 
accagcgtgt 
ctccaattga 
tcaagctaca 
ttgaaacttc 
aagcgattga 
aagaaggaga 
agaatcgagc 
ttcaaatgat 
gtgggaagag 
aggaaccatt 
tgtatacaat 
caaggaaatg 
ttgaattggg 
gagctcccat 
catttggcag 
atgcagccat 
cccttgacaa 
agccagcatc 
tataagaaac 
attataagta 
gtccctccag 
aaaaaaaaa 



gatgagagca 
tcattctaat 
atttcatttc 
tgctacattc 
atataactgg 
tgctgaatta 
ttatgcgctc 
ttttcgtgct 
tgaggatcct 
tattgatgct 
tatgagcaga 
tgaacttggc 
tgaacgcaag 
ttttatatac 
agttcaccat 
tgcatcagaa 
tgcactgaat 
agaactcgaa 
ccaaaaagca 
agaagaagag 
tgaaggagac 
agttgaatcc 
agatatttca 
tgaggatgat 
agaaggagaa 
tggagtccaa 
ggcatctcgt 
tgaagcattc 
agcagacgtt 
gtcacaagag 
tggaattcca 
gccttcagcc 
cccgttgtac 
cgccatagtt 
ggtcatatct 
aacagtaagc 
agaaatcggt 
caagtcatgt 
tctgatactt 



acagatcagc 
gccttagaca 
aactgtcttc 
gaaaatatgg 
gtgattcaag 
cttggaatga 
ttagttgagt 
cttaaaaact 
ctctctttgt 
ttagatgcaa 
aagtatcgaa 
tatgagattg 
cgttgggtac 
tcaaacccta 
aggaagctct 
tctgattaca 
cttttgaagc 
gctcagggtt 
aggagaataa 
gaggaggtcg 
acagagttct 
aaggaaacga 
aaagaagctg 
gatgaaaatg 
gatttagtga 
ctcttgaaag 
atgacacttg 
aaagaaatga 
tggggttgga 
tgggaagtcg 
acgattggtg 
ttcttgaaga 
gatgagatca 
gcagatatgg 
gctagacaat 
tcttagttat 
agaaggagct 
agttccgagt 
aattctttag 



tttatgattt 
tctcttacga 
ttagtgtgca 
agtacggtga 
cctacactag 
tggttgagga 
gcttcaccaa 
ttgaaggagg 
atctcagggc 
tgcgcaaaga 
cactagtcag 
attatttagc 
ctcgaagagg 
ttgaaacatc 
tgagaacctt 
tgagagtggt 
cgaaagcagc 
tgccaattga 
acaaatctcg 
atgaagaagt 
ggaaacgtcg 
ctgaatcagt 
acaatgagga 
aagaggaaga 
agaataaggc 
aatccgatga 
aggatgatgc 
gggaaagaaa 
catgggagaa 
agttggcaat 
attgtgcagt 
tcttgcagac 
tcacattgtg 
aaaccacagg 
ctaatgagag 
catatcctct 
gaatctgtct 
gtagaacagt 
tgttggatct 



gatgattgaa 
aatggaggca 
ggctacatgt 
aggtttattt 
agccgagtca 
ccacaaacgt 
atattgtgtc 
aacagtaatt 
tttgtgtcga 
taaccaacct 
ctcatggatt 
gaggtacata 
gaaaactcct 
ctttaaacag 
acagagtgaa 
ggagagatta 
aagcaagatg 
tggaacaaga 
aggtcgacct 
agacgattta 
gtttcttgga 
ggttacaggt 
ggatgatgat 
agtggttgtt 
agctgacgcg 
agcaaacaga 
agatgaggat 
agtgttcgat 
ggattttaag 
tgtgctcatg 
gatattacga 
gacacacagt 
tttggacctt 
gatcactgtc 
tccgcggtct 
tctgcttgtt 
tagttatgaa 
ttttactagt 
ttctatataa 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
2959 



<210> 66 

<211> 1295 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 66 

aagcttcgaa 

agctagagaa 

actacgacga 

actacttcaa 

gctccggtgg 

tcatcgttat 

agtctgatgc 

tggagaacgt 

ttgatgtttc 



gtcgatttca 
caacattctc 
cgcttcacaa 
gcgcgttcac 
cacaatcgag 
ggatgctttt 
ctatgagtat 
tgttggatgg 
gacacagatg 



atggaaggtt 
ccagtggaac 
gccaaaatcc 
atctcagccc 
atcatgggtc 
gctttgcctg 
atggttgaat 
tatcactctc 
cttaaccaac 



cctcgtcagc 
caaccgattc 
agcaggagaa 
ttgctcttct 
ttatgcaggg 
ttgaaggtac 
actctcagac 
accctgggta 
agtatcagga 



catcgcgagg 
agcctccgac 
gccatgggcc 
caagatggtg 
taaaaccgag 
tgagactagg 
cagcaagctg 
tggatgttgg 
gccattctta 



aagacatggg 
agtatattcc 
tccgatccta 
gttcacgctc 
ggtgatacaa 
gttaatgctc 
gctgggaggt 
ctctcgggta 
gctgttgtta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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ttgatccaac 
cagagggaca 
agattgagga 
agtcatctct 
tttcttcttc 
ctgagaagct 
cagccggtca 
gtgcaaagat 
tgttcaattc 
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75 










80 


Val 


Tyr 


Tyr 


He 


Gly 


Thr 


Val He 


Pro 


He 


Ala 


Val 


Leu 


Leu 


Leu 


Ser 










85 








90 










95 




Tyr 


Leu 


Val 


Pro 


Val 


Lys 


Pro Val 


Arg 


Pro 


Lys 


Thr 


Arg 


Lys 


Glu 


Glu 



100 105 110 



<210> 81 

<211> 119 

<212> PRT 

<213>' Arabidopsis thaliana 
<220> 

<221> VARIANT 

<222> 97,98,113 

<223> Xaa = any amino acid 

<400> 81 



Val 


Phe 


Glu 


Tyr 


Met 


Asp 


Thr Asp 


Val 


Lys 


Lys 


Phe 


He 


Arg 


Ser 


Phe 


1 








5 










10 










15 




Arg 


Ser 


Thr 


Gly 
20 


Lys 


Asn 


He 


Pro 


Thr 

25 


Gin 


Thr 


He 


Lys 


Ser 
30 


Leu 


Met 


Tyr 


Gin 


Leu 
35 


Cys 


Lys 


Gly 


Met 


Ala 
40 


Phe 


Cys 


His 


Gly 


His 
45 


Gly 


He 


Leu 


His 


Arg 
50 


Asp 


Leu 


Lys 


Pro 


His 
55 


Asn 


Leu 


Leu 


Met 


Asp 
60 


Pro 


Lys 


Thr 


Met 


Arg 


Leu 


Lys 


He 


Ala 


Asp 


Leu 


Gly 


Leu 


Ala 


Arg Ala 


Phe 


Thr 


Leu 


Pro 


65 










70 










75 










80 


Met 


Lys 


Lys 


Tyr 


Thr 
85 


His 


Glu 


He 


Leu 


Thr 
90 


Leu 


Trp 


Tyr 


Arg 


Ala 
95 


Pro 


Xaa 


Xaa 


Ser 


Ser 
100 


Trp 


Cys 


His 


Pro 


Leu 
105 


Leu 


Tyr 


Ser 


Cys 


Gly 
110 


Tyr 


Val 


Xaa 


Cys 


Trp 


Leu 


His 


He 


Cys 





















115 



<210> 
<211> 
<212> 



82 

296 

PRT 
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<213> Arabidopsis thaliana 



<400> 82 



Pro 


Lys 


Arg 


Arcr 


Met 


Ser 


Met 


Glu 


Met 


Glu 


Leu 


Phe 


Val 


Thr 


Pro 


Glu 


1 




5 










10 










15 




Lys 


Gin Arg 


Gin 


His 


Pro 


Ser 


Val 


Ser 


Val 


Glu 


Lys 


Thr 


Pro 


Val 


Arg 






20 










25 










30 






Arc 


Lys 


Leu 
35 


lie 


Val 


Asp 


Asp 


Asp 
40 


Ser 


Glu 


He 


Gly 


Ser 
45 


Glu 


Lys 


Lys 


Glv 


Gin 


Ser 


Arcr 


Thr 


Ser 


Gly 


Gly Gly Leu Arg 


Gin 


Phe 


Ser 


Val 


Met 




50 










55 










60 










Val 


Cys 


Gin 


Lys 


Leu 


Glu 


Ala 


Lys 


Lys 


He 


Thr 


Thr 


Tyr 


Lys 


Glu 


Val 


65 










70 




















80 


Ala 


Asp 


Glu 


lie 


lie 
85 


Ser 


Asp 


Phe 


Ala 


Thr 
90 


Tip 
IXC 


Lys 


Gin 


Asn 


Ala 
95 


Glu 


jj y o 


Pro 


Leu 


Asn 


Glu 


Asn 


Glu 


Tyr Asn 


Glu 


Lys 


Asn 


lie 


Arg Arg Arg 








100 










105 










110 






V d-L 


Tyr Asp 


Ala 


Leu 


Asn 


Val 


Phe 


Met 


Ala 


Leu 


Asp 


lie 


He 


Ala 


Arg 






115 










120 










125 








nop 


Lys 


Lys 


Glu 


Tie 


Ar g 




Lys 


Gly Leu 


rIO 


He 


Thr 


Cys 


Lys 


Lys 




130 










135 










140 












Val 


Glu 


Glu 


Val 


Lys 


Met 


Asp 


Arg 


Asn 


Lys 


Val 


Met 


Ser 


Ser 


Val 


145 










150 










1 j D 










160 


oJ-ii 


Lys 


Lys 


an » 

/~ix a 


nlo 

165 




T.oi 1 


Lys 


Glu 


Leu 
170 


Arg 


Glu 


Lys 


Val 


Ser 
175 


Ser 




Glu 


Ser 


T.on 

180 






.rix. y 


Asn 


Gin 
185 


Glu 


Met 


Val 


Val 


Lys 
190 


Thr 


Gin 




Pro 


Ala 


Glu 


r^l v 
uxy 


Phe 


Thr 


Leu 


Pro 


Phe 


He 


Leu 


Leu. 


Glu 


Thr 


Asn 




195 










200 










205 








Pro 


His 
210 


Ala 


V d-L 


V &X 




215 


Glu 


lie 


Ser 


Glu 


220 


Met 


Gin 


Leu 


Val 


His 


Leu 


Asp 


Phe 


Asn 


Ser 


Thr 


Pro 


Phe 


Ser 


Val 


His 


Asp 


Asp Ala 


Tyr 


225 










230 










235 










240 


lie 


Leu 


Lys 


Leu 


Met 
245 


Gin 


Glu 


Gin 


Lys 


Gin 
250 


Glu 


Gin 


Asn 


Arg 


Val 
255 


Ser 


Ser 


Ser 


Ser 


Ser 
260 


Thr 


His 


His 


Gin 


Ser 
265 


Gin 


His 


Ser 


Ser 


Ala 
270 


His 


Ser 


Ser 


Ser 


Ser 


Ser 


Cys 


lie 


Ala 


Ser 


Gly Thr 


Ser 


Gly 


Pro 


Val 


Cys 


Trp 






275 










280 










285 








Asn 


Ser 
290 


Gly 


Ser 


lie 


Asp 


Thr 
295 


Arg' 



















<210> 83 
<211> 173 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 83 

Met Gin Pro Thr Glu Thr Ser Gin Pro Ala Pro Ser Asp Gin Gly Arg 

15 10 15 

Arg Leu Lys Asp Gin Leu Ser Glu Ser Met Ser Phe Ser Ser Gin Met 

20 25 30 

Lys Lys Glu Asp Asp Glu Leu Ser Met Lys Ala Leu Ser Ala Phe Lys 

35 4 0 45 

Ala Lys Glu Glu Glu He Glu Lys Lys Lys Met Glu He Arg Glu Arg 
50 55 60 
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Val 


Gin 


Ala Gin 


Leu 


Gly 


Arg 


Val 


Glu 


Asp 


Glu 


Ser 


Lys 


Arg 


Leu 


Ala 


65 








70 










75 










80 


Met 


He 


Arg Glu 


Glu 
85 


Leu 


Glu 


Gly 


Phe 


Ala 
90 


Asp 


Pro 


Met 


Arg 


Lys 
95 


Glu 


Val 


Thr 


Met Val 


Arg 


Lys 


Lys 


He 


Asp 


Ser Leu 


Asp 


Lys 


Glu 


Leu 


Lys 






100 










105 










110 






Pro 


Leu 


Gly Asn 
115 


Thr 


Val 


Gin 


Lys 
120 


Lys 


Glu 


Thr 


Glu 


Tyr 
125 


Lys 


Asp 


Ala 


Leu 


Glu 
130 


Ala Phe 


Asn 


Glu 


Lys 
135 


Asn 


Lys 


Glu 


Lys 


Val 
140 


Glu 


Leu 


He 


Thr 


Lys 


Leu 


Gin Glu 


Leu 


Glu 


Gly 


Glu 


Ser 


Glu 


Lys 


Phe 


Arg 


Phe 


Lys 


Lys 


145 








150 










155 










160 


Leu 


Glu 


Glu Leu 


Ser 
165 


Lys 


Asn 


He 


Asp 


Leu 
170 


Thr 


Lys 


Pro 









<210> 84 
<211> 46 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 84 

Gin Lys Gin Ala Pro Gly Ala Gly Asp Val Pro Ala Thr He Gin Glu 

1^5 10 15 

Glu Asp Asp Asp Asp Asp Val Pro Asp Leu Val Val Gly Glu Thr Phe 

20 25 30 

Glu Thr Pro Ala Thr Glu Glu Ala Pro Lys Ala Ala Ala Ser 
35 40 45 



<210> 85 

<211> 383 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 85 



Met 


Glu 


Asp 


Asp 


Asp 


Glu 


He 


Gin 


1 








5 








Ser 


Leu 


Ser 


Pro 


Gin 


Ala 


Pro 


Pro 








20 










Asp 


Val 


Thr 


Val 


Ala 


Val 


Val 


Lys 






35 










40 


Gin 


Ser 


Pro 


Ser 


Met 


Asn 


Ala 


Leu 




50 










55 




Val 


Thr 


Gly 


Gly 


Gly 


Gly 


Ser 


Gly 


65 










70 






Gly 


Gly 


Ser 


Gly 


Gly 


Gly 


Gly 


Gly 










85 








Glu 


Ala 


Thr 


Lys 


Val 


Leu 


He 


Glu 








100 










Pro 


Gly 


Lys 


Gly 


Thr 


Leu 


Lys 


Gin 






115 










120 


lie 


Val 


Asn 


Lys 


Ser 


Arg 


Gin 


Cys 




130 










135 




Cys 


Lys 


Asn 


Arg 


He 


Asp 


Thr 


Val 


145 










150 






Ala 


Lys 


He 


Ala 


Ser 


Gly 


Asp 


Gly 



Ser 


He 


Pro 


Ser 


Pro 


Gly 


Asp 


Ser 




10 










15 




Ser 


Pro 


Pro 


He 


Leu 


Pro 


Thr 


Asn 


25 










30 






Lys 


Pro 


Gin 


Pro 


Gly 


Leu 


Ser 


Ser 










45 








Ala 


Leu 


Val 


Val 


His 


Thr 


Pro 


Ser 








60 










Asn 


Arg 


Asn 


Gly 


Arg 


Gly 


Gly 


Gly 






75 










80 


Gly Arg Asp 


Asp 


Cys 


Trp 


Ser 


Glu 




90 










95 




Ala 


Trp 


Gly Asp 


Arg 


Phe 


Ser 


Glu 


105 










110 






Gin 


His 


Trp 


Lys 


Glu 


Val 


Ala 


Glu 










125 








Lys 


Tyr 


Pro 


Lys 


Thr 


Asp 


He 


Gin 








140 










Lys 


Lys 


Lys 


Tyr 


Lys 


Gin 


Glu 


Lys 






155 










160 


Pro 


Ser 


Lys 


Trp 


Val 


Phe 


Phe 


Lys 
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165 










170 










175 




Lys 


Leu 


Glu Ser 


Leu 


He 


Gly 


Gly 


Thr 


Thr 


Thr 


Phe 


He 


Ala 


Ser 


Ser 




180 










185 










190 






Lys 


Ala 


Ser Glu 


Lys 


Ala 


Pro 


Met 


Gly 


Gly Ala 


Leu 


Gly 


Asn 


Ser 


Arg 




195 








200 










205 








Ser 


Ser 


Met Phe 


Lys 


Arg Gin 


Thr 


Lys 


Gly Asn 


Gin 


He 


Val 


Gin 


Gin 




210 








215 










220 










Gin 


Gin 


Glu Lys 


Arg 


Gly 


Ser 


Asp 


Ser 


Met 


Arg 


Trp 


His 


Phe Arg Lys 


225 






230 










235 










240 


Arg 


Ser 


Ala Ser 


Glu 


Thr 


Glu 


Ser 


Glu 


Ser 


Asp 


Pro 


Glu 


Pro 


Glu 


Ala 






245 










250 










255 




Ser 


Pro 


Glu Glu 


Ser 


Ala 


Glu 


Ser 


Leu 


Pro 


Pro 


Leu 


Gin 


Pro 


He 


Gin 






260 










265 










270 






Pro 


Leu 


Ser Phe 


His 


Met 


Pro 


Lys 


Arg 


Leu 


Lys 


Val 


Asp 


Lys 


Ser 


Gly 






275 








280 










285 








Gly 


Gly 


Gly Ser 


Gly 


Val 


Gly 


Asp 


Val 


Ala 


Arg 


Ala 


He 


Leu 


Gly 


Phe 


290 








295 










300 










Thr 


Glu 


Ala Tyr 


Glu 


Lys 


Ala 


Glu 


Thr 


Ala 


Lys 


Leu 


Lys 


Leu 


Met 


Ala 


305 






310 










315 










320 


Glu 


Leu 


Glu Lys 


Glu 


Arg 


Met 


Lys 


Phe 


Ala 


Lys 


Glu 


Met 


Glu 


Leu 


Gin 






325 










330 










335 




Arg 


Met 


Gin Phe 


Leu 


Lys 


Thr 


Gin 


Leu 


Glu 


He 


Thr 


Gin 


Asn 


Asn 


Gin 




340 










345 










350 






Glu 


Glu 


Glu Glu 


Arg 


Ser 


Arg 


Gin 


Arg 


Gly 


Glu 


Arg 


Arg 


He 


Val 


Asp 






355 








360 










365 








Asp 


Asp 


Asp Asp 


Arg 


Asn Gly 


Lys 


Asn Asn Gly 


Asn 


Val 


Ser 


Ser 




370 








375 










380 











<210> 86 

<211> 131 

<212> PRT 

<213> Arabidopsis thaliana 
<220> 

<221> VARIANT 

<222> 70,118 

<223> Xaa = any amino acid 



<400> 86 



Gly 


Thr 


Ser 


Leu 


Leu 


Leu 


His 


Ala 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


He 


Ser 


1 








5 










10 










15 




Leu 


Thr 


He 


Pro 
20 


Ser 


Asn 


His 


Ser 


Ser 
25 


Met 


Ala 


Thr 


Val 


Ser 
30 


Ser 


Ser 


Ser 


Trp 


Pro 


Asn 


Pro 


Asn 


Pro 


Asn 


Pro 


Asp 


Ser 


Thr 


Ser 


Ala 


Ser 


Asp 




35 










40 










45 








Ser 


Asp 


Ser 


Thr 


Phe 


Pro 


Ser 


His Arg Asp Arg 


Val 


Asp 


Glu 


Pro 


Asp 




50 










55 










60 










Ser 


Leu 


Asp 


Ser 


Phe 


Xaa 


Ser 


Met 


Ser 


Leu 


Asn 


Ser 


Asp 


Glu 


Pro 


Asn 


65 








70 










75 










80 


Gin 


Thr 


Ser 


Asn 


Gin 
85 


Ser 


Pro 


Leu 


Ser 


Pro 
90 


Pro 


Thr 


Pro 


Asn 


Leu 
95 


Pro 


Val 


Met 


Pro 


Pro 


Pro 


Phe 


Val 


Leu 


Tyr 


Leu 


Ser 


Phe 


Asn 


Gin Asp 


His 








100 










105 










110 






Ala 


Cys 


Phe 


Ala 


Cys 


Xaa 


His 


Phe 


Val 


Pro 


Ser 


Leu 


Ser 


Leu 


Tyr 


Leu 




115 










120 










125 








Ser 


Ala 


Thr 
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130 



<210> 87 

<211> 181 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 87 



Gin 


Ala 


His 


Asp 


Ser 


Arg 


He 


Ala 


Cys 


Phe 


Ala 


Leu 


Thr 


Gin 


Asp 


Gly 


1 








5 










10 










15 




His 


Leu 


Leu 


Ala 


Thr 


Ala 


Ser 


Ser 


Lys 


Gly Thr 


Leu Val Arg 


lie 


Phe 








20 










25 










30 






Asn 


Thr 


Val 


Asp 


Gly 


Thr 


Leu 


Arg 


Gin 


Glu 


Val 


Arg Arg Gly Ala Asp 






35 










40 










45 








Arg 


Ala 
50 


Glu 


lie 


Tyr 


Ser 


Leu 
55 


Ala 


Phe 


Ser 


Ser 


Asn 
60 


Ala 


Gin 


Trp 


Leu 


Ala 


Val 


Ser 


Ser 


Asp 


Lys 


Gly 


Thr 


Val 


His 


Val 


Phe 


Gly Leu 


Lys 


Val 


65 










70 










75 










80 


Asn 


Ser 


Gly 


Ser 


Gin 
85 


Val 


Lys 


Asp 


Ser 


Ser 
90 


Arg 


He 


Ala 


Pro 


Asp 
95 


Ala 


Thr 


Pro 


Ser 


Ser 


Pro 


Ser 


Ser 


Ser 


Leu 


Ser 


Leu 


Phe 


Lys 


Gly Val 


Leu 








100 










105 










110 






Pro 


Arg 


Tyr 
115 


Phe 


Ser 


Ser 


Glu 


Trp 
120 


Ser 


Val 


Ala 


Gin 


Phe 
125 


Arg 


Leu 


Val 


Glu 


Gly 
130 


Thr 


Gin 


Tyr 


He 


Ala 
135 


Ala 


Phe 


Gly 


His 


Gin 
140 


Lys 


Asn 


Thr 


Val 


Val 


lie 


Leu 


Gly 


Met 


Asp 


Gly 


Ser 


Phe 


Tyr 


Arg 


Cys 


Gin 


Phe 


Asp 


Pro 


145 










150 










155 










160 


Val 


Asn 


Gly 


Gly 


Glu 
165 


Met 


Ser 


Gin 


Leu 


Glu 
170 


Tyr 


His 


Asn 


Cys 


Leu 
175 


Lys 


Pro 


Pro 


Ser 


Val 


Phe 

























180 



<210> 88 

<211> 175 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 88 



Met 


Asp 


Asp 


Ser 


Glu 


Glu 


Asp 


Gin Arg 


Leu 


Pro 


His 


His 


Lys 


Asp 


Pro 


1 








5 










10 










15 




Lys 


Glu 


Phe 


Val 


Ser 


Leu 


Asp 


Lys 


Leu 


Ala 


Glu 


Leu 


Gly Val 


Leu 


Ser 








20 










25 










30 






Trp 


Arg 


Leu 
35 


Asp 


Ala 


Asp 


Asn 


Tyr 
40 


Glu 


Thr 


Asp 


Glu 


Asp 
45 


Leu 


Lys 


Lys 


He 


Arg 
50 


Glu 


Ser 


Arg 


Gly 


Tyr 
55 


Ser 


Tyr 


Met 


Asp 


Phe 
60 


Cys 


Glu 


Val 


Cys 


Pro 


Glu 


Lys 


Leu 


Pro 


Asn 


Tyr 


Glu 


Val 


Lys 


Val 


Lys 


Ser 


Phe 


Phe 


Glu 


65 










70 










75 










80 


Glu 


His 


Leu 


His 


Thr 
85 


Asp 


Glu 


Glu 


He 


Arg 
90 


Tyr 


Cys 


Val 


Ala 


Gly 
95 


Thr 


Gly 


Tyr 


Phe 


Asp 


Val 


Arg 


Asp 


Arg Asn 


Glu 


Ala 


Trp 


He 


Arg 


Val 


Leu 








100 










105 










110 






Val 


Lys 


Lys 


Gly 


Gly 


Met 


He 


Val 


Leu 


Pro 


Ala 


Gly 


He 


Tyr 


His 


Arg 



115 120 125 
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Phe Thr Val Asp Ser Asp Asn Tyr 

130 135 
Gly Glu Pro Val Trp Thr Pro Tyr 
145 150 
Ala Arg Lys Glu Tyr Val Asp Asn 
165 



lie Lys Ala Met Arg Leu Phe Val 
140 

Asn Arg Pro His Asp His Leu Pro 
155 160 
Phe Met lie Asn Ala Ser Ala 
170 175 



<210> 89 








<211> 98 








<212> PRT 








<213> Arabidopsis thaliana 


<400> 89 








Thr Ser 


Phe 


Pro 


He 


Thr Arg Lys 


1 






5 




Asp Ser 


Glu 


Asp 


Thr 


Lys Gin Ser 






20 






Gin Asn 


Leu 


Leu 


Gin 


Gin Met Gin 




35 






40 


Ser lie 


He 


Thr 


Lys 


He Asp Asp 


50 








55 


Glu Gin 


Ser 


He 


Asn 


Asp Leu Arg 


65 








70 


Pro Pro 


Pro 


Ala 


Ser 


Lys Ser Gly 








85 




Ser Ser 











Lys Thr Leu Lys Met Asp Gly His 

10 15 
Thr Ala Asp Met Thr Ala Phe Val 
25 30 
Thr Arg Phe Gin Thr Met Ser Asp 
45 

Met Gly Gly Arg He Asn Glu Leu 
60 

Ala Glu Met Gly Val Glu Gly Thr 

75 80 
Asp Glu Pro Lys Thr Pro Ala Ser 
90 95 



<210> 90 

<211> 117 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 90 



Ala 


Gin 


Val 


Arg 


Ala 


Lys 


Met 


Leu 


Lys 


Glu 


Val 


Ala 


Thr 


Glu 


Lys 


Gin 


1 






5 








10 










15 




Thr 


Ala 


Val 


Asp 


Thr 


His 


Phe 


Ala 


Thr 


Ala 


Lys 


Lys 


Leu 


Ala 


Gin 


Glu 








20 










25 










30 




Ala 


Gly 


Asp 


Ala 


Leu 


Phe 


Val 


Lys 


He 


Phe 


Ala 


He 


Lys 


Lys 


Leu 


Leu 


35 










40 










45 






Thr 


Lys 


Leu 


Glu 


Ala 


Glu 


Lys 


Glu 


Ser 


Val 


Asp 


Gly 


Lys 


Phe 


Lys 


Glu 


50 










55 










60 








Glu 


Val 


Lys 


Glu 


Leu 


Ser 


His 


Leu 


Leu 


Ala 


Asp 


Ala 


Ser 


Glu 


Ala 


Tyr 


65 








70 










75 










80 


Glu 


Tyr 


His 


Gly 


Ala 


Val 


Arg 


Lys 


Ala 


Lys 


Asp 


Glu 


Gin 


Ala 


Ala 


Glu 






85 










90 










95 




Glu 


Phe 


Ala 


Lys 
100 


Glu 


Ala 


Thr 


Gin 


Ser 
105 


Ala 


Glu 


He 


He 


Trp 
110 


Val 


Lys 


Phe 


Leu 


Ser 
115 


Ser 


Leu 

























<210> 91 
<211> 216 
<212> PRT 
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<213> Arabidopsis thaliana 



<400> 


91 












Met 


Glu 


Phe 


Gly 


Ser 


Phe 


Leu 


Val 


1 








5 








Phe 


Val lie 


Leu 


Met 


Leu 


Leu 


Phe 








20 










Asn 


Ala 


Pro 


lie 


Tyr Tyr 


Pro .Asn 






35 










40 


Trp 


Glu 


Gly Thr 


Ser 


Leu 


Thr Arg 




50 










55 




Ala 


Leu 


Thr 


Ser 


Ser 


Glu 


Gin Asp 


65 










70 






Thr 


Ala 


Val 


His 


Phe 
85 


Val 


Phe 


Leu 


Cys 


Ser 


Ser 


Leu 
100 


Leu 


Leu 


Leu 


Pro 


Asp 


Asn 


Asn 
115 


lie 


Lys 


Asn 


Thr 


Lys 
120 


Gly 


Thr 
130 


Phe 


Ser 


Gin 


Leu 


Asp 
135 


Asn 


Lys 


Ser 


Ser Arg 


Leu 


Trp 


Ala 


Phe 


145 










150 






Leu 


Val 


Thr 


Tyr 


Phe 
165 


Phe 


Leu 


Trp 


Leu 


Arg 


Ala 


Gin 
180 


Ala 


Leu 


Met 


Ser 


Ala 


lie 


Leu 
195 


Val 


Arg 


Asp 


Met 


Pro 
200 


Arg 


Glu 
210 


Phe 


Gin 


He 


Tyr 


Glu 
215 


Ser 



Ser 


Leu 

:io 


Gly 


Thr 


Ser 


Phe 


Val 
15 


lie 


Thr 


Trp 


Leu 


Ser Arg 


Lys 


Ser 


Gly 
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Arg 


lie Leu Lys Gly 


Leu 


Glu 


Pro 
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Pro 


Phe 


Ala Trp 


Met 


Arg 


Glu 
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Val 


Val 


Asn 


Le.u 


Ser 


Gly Val Asp 
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Ser 


Thr 
90 


Val 


Leu 


Gly 


He 


Phe 
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Ala 


Thr 


Leu 


Leu 


Pro 


Leu 


Ala 


Ala 


Thr 
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110 






Asn 


Ala 


Thr 


Asp 


Thr 
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Thr 


Ser 


Lys 


Leu 


Ser 


Met 


Ala 
140 


Asn 


He 


Thr 


Lys 


Leu 


Gly Ala Val 


Tyr 


Trp 


He 


Ser 
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Lys 


Ala 
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Tyr 


Lys 


His 


Val 


Ser 
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Ala 
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Val 


Lys 


Pro 


Glu 


Gin 


Phe 
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Arg Arg 


Gly 
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<212> PRT 

<213> Arabidopsis thaliana 
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Ala 
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Val 
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Gin 


Leu 


Glu 


Leu 
40 


Trp 


Met 
50 


Ala 


Gly 


Val 


Gly 


Ala 
55 


Pro 


Gly 


Gin 


Leu 


Val 


Ser 


Glu 


Tyr Ala 
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Gin 


His 


Leu 


Lys 


Asp 
85 


He 


Ala 


Gly 


Asp 


Ala 


Gly 


Gin 
100 


Val 


Ala 


Lys 


Leu 


His 


Lys 


Arg 
115 


Arg 


Lys 


He 


Leu 


Gin 
120 


Phe 


Thr 
130 


Leu 


Glu 


Glu 


Gly 


Ser 
135 


Ser 



Gin 


Arg 


Gin 


He Arg 


Ser 


Trp 


Leu 




10 








15 




Val 


Thr 


Ala 


Asp Asp 


Val 


Ser 


Gly 


25 
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Leu 


Ser 


Thr 


Ala He 


Met 


Asp 


Gly 
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Val 


Pro 


Pro 


His Thr 


Asp 


Ala 


Leu 
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Lys 


Arg 


Val 


Tyr Thr 


Ser 


Gin 


Met 
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80 


Thr 


Leu 


Ala 


Ser Glu 


Glu 


Ala 


Glu 
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Arg 


Ser 


Ala 


Leu Glu 
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Val 


Asp 
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Gin 


Met 
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Ser Asp 
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Ala 


Leu 
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Pro 


Val 


Gin 


Asn Pro 


Ser 


Thr 


Ala 



140 
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Ala 


Glu 


Asp 


Ser Arg 


Leu 


Ala 


Ser 


Leu 


He 


Ser 


Leu 


Asp Ala 


lie Leu 


145 










150 










155 








160 


Lys 


Gin 


Val 


Lvs 


Glu 


He 


Thr 


Arg 


Gin 


Ala 


Ser 


Val 


His 


Val 


Leu Ser 








165 










170 










175 


Lvs 


Ser 


Lys 


Lvs 


Lvs 


Ala 


Leu 


Leu 


Glu 


Ser 


Leu 


Asp 


Glu 


Leu 


Asn Glu 






180 










185 










190 




Arq 


Met 


Pro 


Ser 


Leu 


Leu 


Asp 


Val 


Asp 


His 


Pro 


Cys 


Ala 


Gin Arg Glu 




195 










200 










205 






He 


Asp 
210 


Thr 


Ala 


His 


Gin 


Leu 
215 


Val 


Glu 


Thr 


He 


Pro 
220 


Glu 


Gin 


Glu Asp 


Asn 


Leu 


Gin 


Asp 


Glu 


Lys 


Arg 


Pro 


Ser 


He 


Asp 


Ser 


He 


Ser 


Ser Thr 


225 










230 










235 








240 


Glu 


Thr 


Asp 


Val 


Ser 


Gin 


Trp 


Asn 


Val 


Leu 


Gin 


Phe 


Asn 


Thr 


Gly Gly 








245 










250 
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Ser 


Ser 


Ala 


Pro 


Phe 


He 


He 


Lys 


Cys 


Gly Ala 


Asn 


Ser 


Asn 


Ser Glu 
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265 
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Leu 


Val 


He 
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Lys 
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Ala 


Arg 
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He 


Gin 


Glu 


Pro 


Lys 
285 


Gly 


Gly Glu 


He 


Val 


Arg 


Val 


Val 


Pro 


Arg 


Pro 


Ser 


Val 


Leu 


Glu 


Asn 


Met 


Ser Leu 




290 
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300 








Glu 


Glu 


Met 


Lys 


Gin 


Val 


Phe 


Gly Gin 


Leu 


Pro 


Glu 


Ala 


Leu 


Ser Ser 
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315 








320 


Leu 


Ala 


Leu 


Ala 


Arg 
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Thr 


Ala 


Asp 
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Pro 
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Glu 
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Asn 


Asp 


Asp 
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Met 


Gin 


Pro 
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Met 


Tyr 
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Met 
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His 
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Pro 


Ser 


Tyr 
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Thr 
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Ser Leu 


Gly Val 


Glu 
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50 








55 




Thr. 


Asn 


Glu Met 


Leu 


Gin 


Ser 


He 


65 








70 






Ser 


Leu 


Val Arg 


Arg 


Val 


Thr 


Val 








85 








Lys 


Glu 


Arg Ala 


Leu 


Leu 


Gly. Glu 






100 










Gin 


lie 


Glu Asn Leu Ser 


Val 


Lys 






115 








120 


Tyr 


Gly. 


Thr Asn 


Ser 


Val 


Leu 


Asn 




130 








135 




Leu 


Val 


Glu Glu 


Thr 


Met 
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<210> 95 
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<212> PRT 

<213> Arabidopsis thaliana 
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Asp 
20 


Glu 


Tyr 


Met 


Asn 


Ser 


Pro 
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Ser 
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Ala 


Lys 
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Arg 
50 


Arg 
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Thr 


Thr 


Thr 
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Val 


Ser 


Arg 


Ser 
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Arg 


Ser 
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Met 


Asp 
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Glu 


Asn 
85 
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Asn 


Lys 


Ser 
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Val 


Lys 
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Phe 


Glu 


Ala 


Asn 


Ser 


Thr 


Arg 
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Ser 


Ala 


Ala 
120 
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Val 
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Gin 


Ser 


Val 
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Ser 
135 
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Lys 


Cys 


Met 


Ala 


Asp 
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Ser 


Leu 
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Ala 
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Thr 
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<212> PRT 

<213> Arabidopsis thaliana 
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Val 


Asp 


Asn 
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Gin 


lie 


Gin 


His 
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Lys 


Met 


Asp 


Arg 
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Arg 


Lys 


Gly 


Asn 
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Val 
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Ala 
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Glu 


Arg 


Ser 
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Leu 


lie 


Leu 
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Leu 


Gin 


Arg 


Lys 




130 






Leu 


Gin 
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Gly 
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Lys 


Arg 
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Arg 


Arg 
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Glu 
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Arg 


Arg 


Arg 


Lys 


105 








Cys 


Pro 


Arg 


Met 


Gly 
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He 








140 


He 


Ser 


Phe 
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Ser 
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Arg 
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Gly Val 


Asp 


Met 
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Gly 
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Val 
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Gin 


Met 
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Glu 
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Met 
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Pro 


Leu 


Asn 


Arg Gly Val 
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<210> 97 
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<212> PRT 

<213> Arabidopsis thaliana 
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Arg 
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Glu 


Leu 
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Pro 


Thr 
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Ala 


Gin 


Leu 
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Ser 
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Pro 


Phe 
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Phe 


Asn 










85 








His 
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Lys 


Thr 


Glu 
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Phe 


Asp 
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Val 


Leu 


Asp 


Lys 


Glu 
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Gly 






115 










120 
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Thr 
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He 


Gly 


Glu 




130 










135 




Glu 


Trp 


He 


Lys 


Glu 


Val 


Asp 


Val 


145 
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Glu 


Asp 


Phe 


He 


Ala 


Arg 


Met 


Val 
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Pro 
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Asp 


Leu 
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Lys 


Lys 


Asp 
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Leu 
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Asp 


Asp 


Gin 
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Met 


Leu 


Phe 


Asp 


Thr 


Asp 


Gly Asp 
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Leu 


Met 
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Arg 


Ser 


Leu 
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He 


Ala 
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Glu 


Asn 
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Arg 


Phe 
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Leu 


Asp 


Leu 


Met 


Ala 
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Lys 


Arg 


Gin 


Leu 


Arg 


Asp 


Ala 


Phe 
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110 
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Ala 
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Arg 


Lys 


Leu 


Glu 
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15 10 15 

Sex Gin Arg Met Arg Ser lie Lys Glu Ser Lys Glu Val Leu Asp Thr 

.20 25 30 

Glu Ser Arg Ser Arg Leu 
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<210> 99 
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<212> PRT 

<213> Arabidopsis thaliana 



<400> 99 
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Ser 


Thr 
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Trp 
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Thr 
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Ser 


Val 


Ser 


Thr 


Ser 


Gly 


Ser 


Met 


Gly 


Ser 


Pro 


Ser 


Ser 


Arg 
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40 
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Ser 


Glu 


Gin 


Thr 


lie 


Thr 


Val 


Val 


Thr 


Ser 


Thr 


Ser Asp 


Thr 


Thr 


Phe 
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Asp 


He 


Gin 
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Gin 
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75 
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Lys 


Lys 


Arg 


Gly 
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Ala 


Ala 


Gly 
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Pro 


Asp 


Lys 


Thr 
100 


Glv 


Arg 


Gly 


Leu 


Arg 
105 


Gin 


Phe 


Ser 


Met 


Lvs 
110 


Val 


Cys 


Glu 


Lys 


Val 


Glu 


Ser 


Lys 


Gly Arg 


Thr 


Thr 


Tyr 


Asn 


Glu 


Val 


Ala 


Asp 






115 










120 










125 








Glu 


Leu 
130 


Val 


Ala 


Glu 


Phe 


Ala 
135 


Leu 


Pro 


Asn 


Asn 


Asp 
140 


Gly 


Thr 


Ser 


Pro 


Asp 


Gin 


Gin 


Gin 


Tvr 


Asp 


Glu 


Lys 


Asn 


He 


Arg Arg Arg 


Val 


Tyr 


Asp 


145 










150 










155 










160 


Ala 


Leu 


Asn 


Val 


Leu 
165 


Met 


Ala 


Met 


Asp 


He 
170 


He 


Ser 


Lys 


Asp 


Lys 
175 


Lys 


Glu 


lie 


Gin 


Trn 
180 


Arg 


Glv 


Leu 


Pro 


Arg 
185 


Thr 


Ser 


Leu 


Ser 


Asp 
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lie 


Glu 


Glu 


Leu 


Lys 


Asn 


Glu 


Arg 


Leu 


Ser 


Leu 


Arg 'Asn Arg 


He 


Glu 


Lys 


Lys 
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200 










205 








Thr 


Ala 


Tyr 


Ser 


Gin 


Glu 


Leu 


Glu 


Glu 


Gin 


Arg 


Asn 


Glu 


His 


Leu 


Tyr 
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220 










Ser 


Ser 


Gly 


Asn 


Ala 


Pro 


Ser 


Gly 
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Val 


Ala 


Leu 


Pro 


Phe 


He 


Leu 


225 
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Thr 
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245 
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Val 


Glu 
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He 
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Glu 
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Pro 


Phe 


Glu 
270 


Leu 


His 


Asp 


Asp 


Asn 
275 


Phe 


Val 


Leu 
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Pro 
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Ser Ala Asp Thr Ala Pro Val Lys 
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Asn Ser Ser Met Lys Pro Glu Asn 
370 375 



Ser Pro Ser Leu Pro Gly lie Met 
365 
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Ala 


Lys 


Glu 
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Val 
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Lys 
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Ala 
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Met 
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Glu 


Asp 


Gly 


Ser 
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Glu 
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Val 


Trp 
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Trp 
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135 
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Glu 
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<212> PRT 

<213> Arabidopsis thaliana 
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Glu 
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Gin 
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Leu 


Asn 
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Glu 
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Met 


Val 


Asn 
60 


Arg 


Glu 
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Ser 


Asp 


Asp 


Lys 


Asp 



Ala Val Ser Arg Pro Phe 
10 

Ser Glu Val Thr Ser Thr 
25 

Arg Pro Ala Leu Glu Asp 
40 

Lys Arg Ala Val Leu Gly 
55 

He Leu Glu Ala Lys Asn 
70 75 
Gly Leu Ala Ser Thr Ser 
90 

Thr Asp Leu Gin Ser Arg 
105 

Thr Ala Gly Asn Leu Ser 
120 

Asn Cys He Glu He Trp 
135 

Arg Ser Ala Ser Thr Ala 
150 155 
Val Pro Asp lie Pro Lys 
170 

Pro Leu Leu Cys Cys Leu 



Thr Arg Ser Leu Ala 
15 

Thr Gin Asn Gin Gin 
30 

Thr Arg Ala Thr Gly 
45 

Glu He Thr Asn Val 
60 

Ser Lys Gin He Lys 
80 

Gin Leu Ala Thr Ser 
95 

Thr Asp Ala Lys Val 
110 

Val Ser Lys Gly Thr 
125 

Asn Ser Arg Leu Pro 
140 

Glu Lys Ser Ala Val 
160 

Phe Val Asp He Asp 
175 

Tyr Ala Pro Glu He 
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180 



His 


Tyr Asn 


Leu 


Arg 


Val 


Ser 


Glu 




195 










.200 


Phe 


Met Glu 


Arg 


lie 


Gin 


Lys 


Asp 




210 








,2 15 




Leu Val Asp 


Trp 


Leu 


Val 


Glu 


Val 


225 








230 






Asp 


Thr Leu 


Tyr 


Leu 


Thr 


Val 


Tyr 








245 








Asn 


Tyr Val 


Gin 


Arg 


Gin 


Gin 


Leu 






260 










Leu 


He Ala 


Ser 


Lys 


Tyr 


Glu 


Glu 




275 










280 


Phe 


Cys Phe 


He 


Thr 


Asp 


Asn 


Thr 




290 








295 




Met 


Glu Asn 


Glh 


Val 


Leu 


Lys 


His 


305 








310 






Thr 


Pro Lys 


Thr 


Phe 


Leu 


Arg 


Arg 








325 








Arg 


Leu Ser 


Pro 


Ser 


Leu 


Glu 


Val 






340 










Glu 


Leu Thr 


Leu 


He 


Asp 


Tyr 


His 




355 










360 


Val 


Ala Ala 


Ser 


Ala 


Val 


Phe 


Leu 




370 








375 




Asn 


His Pro 


Trp 


Asn 


Pro 


Thr 


Leu 


385 








390 






Ser 


Asp Leu 


Lys 


Ala 


Ser 


Val 


His 








405 








Thr 


Lys Gly 


Cys 


Pro 


Leu 


Ser 


Ala 






420 










Lys 


Tyr Lys 


Ser 


Val 


Ala 


Val 


Leu 



435 440 



Leu Phe 
450 



185 








190 






Leu Lys 


Arg 


Arg 


Pro 


Leu 


Pro 


Asp 








205 








Val Thr 


Gin 


Ser 


Met 


Arg 


Gly 


lie 






,22>0 










Ser Glu 


Glu 


Tyr Thr 


Leu 


Ala 


Ser 




235 










240 


Leu lie 


.Asp 


Trp 


Phe 


Leu 


His 


Gly 


250 










.255 




Gin Leu 


Leu 


Gly 


He 


Thr 


Cys 


Met 


265 








270 






He Ser 


Ala 


Pro 


Arg 


He 


Glu 


Glu 








285 








Tyr Thr 


Arg 


Asp 


Gin 


Val 


Leu 


Glu 






300 










Phe Ser 


Phe 


Gin 


He 


Tyr 


Thr 


Pro 




315 










320 


Phe Leu 


Arg 


Ala 


Ala 


Gin 


Ala 


Ser 


330 










335 




Glu Phe 


Leu 


Ala 


Ser 


Tyr 


Leu 


Thr 


345 








350 






Phe Leu 


Lys 


Phe 


Leu 


Pro 


Ser 


Val 








365 








Ala Lys 


Trp 


Thr 


Met 


Asp 


Gin 


Ser 






380 










Glu His 


Tyr 


Thr 


Thr 


Tyr 


Lys 


Ala 




395 










400 


Ala Leu 


Gin 


Asp 


Leu 


Gin 


Leu 


Asn 


410 










415 




He Arg 


Met 


Lys 


Tyr 


Arg 


Gin 


Glii 


425 








430 






Thr Ser 


Pro 


Lys 


Leu 


Leu 


Asp 


Thr 



<210> 108 
<211> 901 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 108 

Met Ala Asn Asn Pro Pro Gin Ser 
1 5 

Val Pro Ala Ala Ser Gin Pro Phe 
20 

Asn Val Gin Ser Gin Pro Pro Gin 

35 40 
Gin Leu Phe Pro Val Arg Pro Gly 

50 55 
Ser Gin Ala Val Ser Val Pro Tyr 
65 70 
Ser Gly Ser Thr Gin Pro Gin Pro 
85 

Ala Thr Ser Gly Pro Pro Phe Ser 
100 



Ser Gly 


Thr 


Gin Gly Gin 


His 


Phe 




10 










15 




His 


Pro 


Tyr 


Gly 


His 


Val 


Pro 


Pro 


25 










30 






Tyr 


Ser 


Gin 


Pro 


He 


Gin 


Gin 


Gin 










45 








Gin 


Pro 


Val 


His 


He 


Thr 


Ser 


Ser 








60 










He 


Gin 


Thr 


Asn 


Lys 


He 


Leu 


Thr 






75 










80 


Asn 


Ala 


Pro 


Pro 


Met 


Thr 


Gly 


Phe 




90 










95 




Ser 


Pro 


Tyr 


Thr 


Phe 


Val 


Pro 


Ser 


105 










110 
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Ser 


Tyr 


Pro 


Gin 


Gin 


Gin 


Pro 


Thr 


Ser 


Leu 


Val 


Gin 


Pro 


Asn 


Ser 


Gin 




115 










120 










125 








Met 


His 
130 


Val 


Ala 


Gly 


Val 


Pro 
135 


Pro 


Ala 


Ala 


Asn 


Thr 
140 


Trp 


Pro 


Val 


Pro 


Val 


Asn 


Gin 


Ser 


Thr 


Ser 


Leu 


Val 


Ser 


Pro 


Val 


Gin 


Gin Thr Gly 


Gin 


145 










150 










155 










160 


Gin 


Thr 


Pro 


Val 


Ala 


Val 


Ser 


Thr Asp 


Pro Gly Asn 


Leu 


Thr 


Pro 


Gin 










165 










170 










175 




Ser 


Ala 


Ser 


Asp 


Trp 


Gin 


Glu 


His 


Thr 


Ser 


Ala 


Asp 


Gly Arg 


Lys 


Ala 








180 








185 










190 






Asp Ma 


Ser 


Thr 


Val 


Trp 


Lys 


Glu 


Phe 


Thr 


Thr 


Pro 


Glu 


Gly Lys 


Lys 






195 










200 










205 








Tvr 


Tvr 


Tyr 


Asn 


Lys 


Val 


Thr 


Lys 


Glu 


Ser 


Lys 


Trp 


Thr 


He 


Pro 


Glu 


210 










215 










220 










Asp 


Leu 


Lys 


Leu 


Ala 


Arg 


Glu 


Gin 


Ala 


Gin 


Leu 


Ala 


Ser 


Glu 


Lys 


Thr 


225 








230 










235 










240 


Ser 


Leu 


Ser 


Glu 


Ala 


Gly 


Ser 


Thr 


Pro 


Leu 


Ser 


His 


His 


Ala 


Ala 


Ser 










245 








250 










255 




Ser 


Ser 


Asp 


Leu 


Ala 


Val 


Ser 


Thr 


Val 


Thr 


Ser 


Val 


Val 


Pro 


Ser 


Thr 






260 










265 










270 






Ser 


Ser 


Ala 
275 


Leu 


Thr 


Gly 


His 


Ser 
280 


Ser 


Ser 


Pro 


lie 


Gin 
285 


Ala 


Gly 


Leu 


Ala 


Val 


Pro 


Val 


Thr 


Arg 


Pro 


Pro 


Ser 


Val 


Ala 


Pro 


Val 


Thr 


Pro 


Thr 




290 








295 










300 










Ser 


Gly 


Ala 


lie 


Ser 


Asp 


Thr 


Glu 


Ala 


Thr 


Thr 


Met 


Tyr 


Tyr 


Phe 


Ser 


305 








310 










315 










320 


Leu 


Glv 


Ser 


Phe 


Ala 


Glu 


Asn 


Lys 


Glu 


Met 


Ser 


Val 


Asn 


Gly 


Lys 


Ala 








325 










330 










335 




Asn 


Leu 


Ser 


Pro 


Ala 


Gly 


Asp 


Lys 


Ala 


Asn 


Val 


Glu 


Glu 


Pro 


Met 


Val 








340 








345 










350 






Tvr 


Ala 


Thr 


Lys 


Gin 


Glu 


Ala 


Lys 


Ala 


Ala 


Phe 


Lys 


Ser 


Leu 


Leu 


Glu 




355 








360 










365 








Ser 


Val 
370 


Asn 


Val 


His 


Ser 


Asp 
375 


Trp 


Thr 


Trp 


Glu 


Gin 
380 


Thr 


Leu 


Lys 


Glu 


lie 


Val 


His 


Asp 


Lys 


Arg 


Tyr 


Gly Ala 


Leu Arg 


Thr 


Leu 


Gly 


Glu 


Arg 


385 






390 










395 










400 


Lys 


Gin 


Ala 


Phe 


Asn 


Glu 


Tyr 


Leu 


Gly 


Gin 


Arg 


Lys 


Lys 


Val 


Glu 


Ala 








405 










410 










415 




Glu 


Glu 


Arg 


Arg 
420 


Arg 


Arg 


Gin 


Lys 


Lys 
425 


Ala 


Arg 


Glu 


Glu 


Phe 
430 


Val 


Lys 


Met 


Leu 


Glu 


Glu 


Cys 


Glu 


Glu 


Leu 


Ser 


Ser 


Ser 


Leu 


Lys 


Trp 


Ser 


Lys 






435 








440 










445 








Ala 


Met 
450 


Ser 


Leu 


Phe 


Glu 


Asn 
455 


Asp 


Gin 


Arg 


Phe 


Lys 
460 


Ala 


Val 


Asp 


Arg 


Pro 


Arg 


Asp 


Arg 


Glu 


Asp 


Leu 


Phe Asp Asn 


Tyr 


lie 


Val 


Glu 


Leu 


Glu 


465 




470 










475 










480 


Arg 


Lys 


Glu 


Arg 


Glu 


Lys 


Ala 


Ala 


Glu 


Glu 


His 


Arg 


Gin 


Tyr 


Met 


Ala 




485 










490 










495 




Asp 


Tyr 


Arg 


Lys 


Phe 


Leu 


Glu 


Thr 


Cys 


Asp Tyr 


He 


Lys 


Ala 


Gly 


Thr 




500 










505 










510 






Gin 


Trp 


Arg 
515 


Lys 


He 


Gin 


Asp 


Arg 
520 


Leu 


Glu 


Asp 


Asp 


Asp 
525 


Arg 


Cys 


Ser 


Cys 


Leu 


Glu 


Lys 


He 


Asp 


Arg 


Leu 


He 


Gly 


Phe 


Glu 


Glu 


Tyr 


He 


Leu 


530 










535 










540 










Asp 


Leu 


Glu 


Lys 


Glu 


Glu 


Glu 


Glu 


Leu 


Lys 


Arg 


Val 


Glu 


Lys 


Glu 


His 


545 








550 










555 










560 


Val 


Arg 


Arg 


Ala 


Glu 


Arg 


Lys 


Asn Arg Asp Ala 


Phe 


Arg 


Thr 


Leu 


Leu 
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565 570 575 



Glu 


Glu 


His 


Val 


Ala 


Ala 


Gly He 


Leu 


Thr 


Ala 


Lys 


Thr 


Tyr 


Trp 


Leu 








580 










585 










590 






Asp Tyr Cys lie 


Glu 


Leu 


Lys 


Asp 


Leu 


Pro 


Gin 


Tyr 


Gin 


Ala 


Val 


Ala 






595 










600 










605 








Ser 


Asn 


Thr 


Ser 


Gly 


Ser 


Thr 


Pro 


Lys 


Asp 


Leu 


Phe 


Glu Asp Val 


Thr 




610 










-615 










620 










Glu 


Glu 


Leu 


Glu 


Lys 


Gin Tyr His 


Glu Asp Lys Ser 


Tyr Val 


Lys 


Asp 


625 










630 










635 










640 


Ala 


Met 


Lys 


Ser 


Arg Lys Ala Asn 


Phe 


Lys 


Ser 


Ala 


lie 


Ser 


Glu 


Asp 










645 










650 










655 




Leu 


Ser 


Thr 


Gin 


Gin 


He 


Ser Asp 


lie 


Asn 


Leu 


Lys 


Leu 


He 


Tyr 


Asp 








660 










665 










670 






Asp 


Leu 


Val 
675 


Gly 


Arg 


Val 


Lys 


Glu 
680 


Lys 


Glu 


Glu 


Lys 


Glu 
685 


Ala 


Arg 


Lys 


Leu 


Gin 
690 


Arg 


Leu 


Ala 


Glu 


Glu 
695 


Phe 


Thr 


Asn 


Leu 


Leu 
700 


His 


Thr 


Phe 


Lys 


Glu 


He 


Thr 


Val 


Ala 


Ser 


Asn 


Trp 


Glu Asp 


Ser 


Lys 


Gin 


Leu 


Val 


Glu 


705 










710 










715 










720 


Glu 


Ser 


Gin 


Glu 


Tyr 


Arg 


Ser 


He 


Gly Asp 


Glu 


Ser 


Val 


Ser 


Gin 


Gly 










725 










730 










735 




Leu 


Phe 


Glu 


Glu 
740 


Tyr 


lie 


Thr 


Ser 


Leu 
745 


Gin 


Glu 


Lys 


Ala 


Lys 
750 


Glu 


Lys 


Glu 


Arg 


Lys 
755 


Arg 


Asp 


Glu 


Glu 


Lys 
760 


Val 


Arg 


Lys 


Glu 


Lys 
765 


Glu 


Arg 


Asp 


Glu 


Lys 
770 


Glu 


Lys 


Arg 


Lys 


Asp 
775 


Lys 


Asp 


Lys 


Glu 


Arg 
780 


Arg 


Glu 


Lys 


Glu 


Arg 


Glu 


Arg 


Glu 


Lys 


Glu 


Lys 


Gly 


Lys 


Glu 


Arg 


Ser 


Lys 


Arg 


Glu 


Glu 


785 










790 










795 










800 


Ser 


Asp 


Gly Glu 


Thr 


Ala 


Met 


Asp 


Val 


Ser 


Glu 


Gly 


His 


Lys 


Asp 


Glu 










805 










810 










815 




Lys 


Arg 


Lys 


Gly 


Lys 


Asp Arg Asp 


Arg 


Lys 


His 


Arg Arg Arg 


His 


His 








820 










825 










830 






Asn 


Asn 


Ser 


Asp 


Glu 


Asp 


Val 


Ser 


Ser 


Asp 


Arg 


Asp Asp Arg Asp 


Glu 






835 










840 










845 








Ser 


Lys 
850 


Lys 


Ser 


Ser 


Arg 


Lys 
855 


His 


Gly 


Asn 


Asp 


Arg 
860 


Lys 


Lys 


Ser 


Arg 


Lys 


His 


Ala 


Asn 


Ser 


Pro 


Glu 


Ser 


Glu 


Ser 


Glu 


Asn 


Arg 


His 


Lys 


Arg 


865 










870 










875 










880 


Gin 


Lys 


Lys 


Glu 


Ser 


Ser Arg Arg 


Ser 


Gly Asn Asp Glu Leu 


Glu 


Asp 










885 










890 










895 




Gly 


Glu 


Val 


Gly 


Glu 

























900 



<210> 109 
<211> 358 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 109 

Met Glu Gly Ser Ser Ser Thr He 
1 * 5 

Asn Ser He Leu Thr Val Asp Ser 
20 

Phe Tyr Tyr Asp Asp Thr Ser Gin 
35 40 



Ala Arg Lys Thr Trp Glu Leu Glu 

10 15 
Pro Asp Ser Thr Ser Asp Asn lie 
25 *" 30 
Thr Arg Phe Gin Gin Glu Lys Pro 
45 
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Trp Glu Asn Asp Pro His Tyr Phe Lys Arg Val Lys He Ser Ala Leu 

50 55 60 

Ala Leu Leu Lys Met Val Val His Ala Arg Ser Gly Gly Thr He Glu 
65 70 75 80 

He Met Gly Leu Met Gin Gly Lys Thr Asp Gly Asp Thr He He Val 

85 90 95 

Met Asp Ala Phe Ala Leu Pro Val Glu Gly Thr Glu Thr Arg Val Asn 

100 105 HO 

Ala Gin Asp Asp Ala Tyr Glu Tyr Met Val Glu Tyr Ser Gin Thr Asn 

115 120 125 

Lys Leu Ala Gly Arg Leu Glu Asn Val Val Gly Trp Tyr His Ser His 

130 135 140 

Pro Gly Tyr Gly Cys Trp Leu Ser Gly He Asp Val Ser Thr Gin Arg 
145 150 155 160 

Leu Asn Gin Gin His Gin Glu Pro Phe Leu Ala Val Val He Asp Pro 

165 170 175 

Thr Arg Thr Val Ser Ala Gly Lys Val Glu He Gly Ala Phe Arg Thr 

180 185 190 

Tyr Ser Lys Gly Tyr Lys Pro Pro Asp Glu Pro Val Ser Glu Tyr Gin 

195 ~ " ~ 200 205 

Thr He Pro Leu Asn Lys He Glu Asp Phe Gly Val His Cys Lys Gin 

210 215 220 

Tvr Tyr Ser Leu Asp Val Thr Tyr Phe Lys Ser Ser Leu Asp Ser His 
225 230 235 240 

Leu Leu Asp Leu Leu Trp Asn Lys Tyr Trp Val Asn Thr Leu Ser Ser 

245 250 255 

Ser Pro Leu Leu Gly Asn Gly Asp Tyr Val Ala Gly Gin He Ser Asp 

260 265 270 

Leu Ala Glu Lys Leu Glu Gin Ala Glu Ser His Leu Val Gin Ser Arg 

275 280 285 

Phe Gly Gly Val Val Pro Ser Ser Leu His Lys Lys Lys Glu Asp Glu 

290 ^ 295 300 

Ser Gin Leu Thr Lys He Thr Arg Asp Ser Ala Lys lie Thr Val Glu 
305 310 315 320 

Gin Val His Gly Leu Met Ser Gin Val He Lys Asp Glu Leu Phe Asn 

325 330 335 

Ser Met Arg Gin Ser Asn Asn Lys Ser Pro Thr Asp Ser Ser Asp Pro 

340 345 350 

Asp Pro Met He Thr Tyr 
355 



<210> 110 
<211> 98 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 110 

Met Ala He Ser Lys Ala Leu He 
1 5 
Val Leu Gin Leu Val Gin Ala Asp 
20 

Gly Tyr Ala Lys Lys He Asp Cys 

35 40 
Arg Leu Ser Arg Arg Pro Arg Leu 

50 55 " 

Cys Tyr Arg Cys Asn Cys Val Pro 



Ala Ser Leu Leu He Ser Leu Leu 

10 15 
Val Glu Asn Ser Gin Lys Lys Asn 
25 30 
Gly Ser Ala Cys Val Ala Arg Cys 
45 

Cys His Arg Ala Cys Gly Thr Cys 
60 

Pro Gly Thr Tyr Gly Asn Tyr Asp 
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65 70 75 80 

Lys Cys Gin Cys Tyr Ala Ser Leu Thr Thr His Gly Gly Arg Arg Lys 
85 90 " 95 

Cys Pro 



<210> 111 



<211> 
<212> 



385 
PRT 



<213> Arabidopsis thaliana 
<220> 

<221> VARIANT 

<222> 252,253 

<223> Xaa = any amino acid 

<400> 111 

Met Gly Lys Lys Asn Lys Arg Ser 
1 5 
Glu Pro Glu Leu Thr Lys lie lie 
20 

Asn Lys Asn Lys Lys Lys Arg Ser 

35 40 
Glu Gin Lys Met Ser Leu Asp Gly 

50 55 
Lys Lys Arg Lys Asn Lys Asn Gin 
65 ~ 7 0 

Glu Lys Thr Lys Val Gin Glu Glu 
85 

Arg Ala Thr Val Ser lie Ala lie 
100 

Gin Ser Leu Glu Leu Ala Thr Arg 
115 120 
Ser Leu Arg Phe Ser Val Phe Pro 

130 135 
Ser Ser lie Ser Asn lie Ser Gin 
145 150 
Ala Thr lie Phe Arg lie Asp Glu 
165 

Ser Ser Glu lie Glu Ser Ala Ala 
180 

Ser Gly Ala Ser Phe Leu Val Arg 
195 200 
Gin Tyr Leu Arg Lys Ser Leu Phe 

210 215 
Val Gly Met Leu Pro Gly Met Leu 
225 230 
Leu Arg Lys His Glu Trp Glu Gin 
245 

Pro Ser Lys Pro Arg Glu Glu Ala 
260 

Arg Tyr Ala Ser Gin Leu Ser Ser 
275 280 
Gly Gly Tyr Asp Tyr Leu lie Gly 

290 295 
Ser Ser Ser Glu Leu Lys lie Pro 



Gin Asp Glu 
10 

Asp Gly Asp 
25 

His Glu Asp 

Asp Ser Arg 

Glu Glu Glu 
75 

Glu Lys Gly 
90 

Ala Gly Ser 
105 

Val He Ser 
Phe Pro Asp 

Leu Ala Gly 

155 

He Val Val 
170 

Thr Asn Ala 
185 

lie Leu Lys 

Pro Lys Gin 

Pro Pro Leu 
235 

Tyr Arg Glu 

250 
Gly Met Tyr 
2 65 

Val Phe Lys 
Thr Ser Glu 
Thr Phe Arg 



Ser Glu Leu 

Ser Lys Lys 
30 

Thr Glu He 
45 

Glu Glu Lys 
60 

Pro Glu Leu 

Asn Val Glu 

He He His 
110 

Leu Ser Leu 

125 
Asn Leu Lys 
140 

Gin He Ala 

Phe Asp Asn 

Ser Asp Ser 
190 

Tyr Leu Glu 

205 
Asn Asp Leu 
220 

Asp Ala Pro 

Xaa Xaa He 

Trp Gly Tyr 
270 

Glu Cys Pro 

285 
His Gly Leu 
300 

His Leu Leu 



Glu Leu 
15 

Lys Lys 

Glu Pro 

He Lys 

Val Thr 
80 

Glu Gly 
95 

Asn Thr 

Tyr Leu 

Ser Pro 

Arg Ala 
160 
Lys Ser 
175 

Asn Glu 

Thr Pro 

Arg Tyr 

His His 
240 
Val Pro 
255 

Lys Val 
Phe Glu 
Val He 
He Ala 
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305 310 



Phe 


Gly 


Gly 


Leu 


Ala 
325 


Gly Leu 


Glu 


Tyr 


Lys 


Gly 


Lys 
340 


Asn 


Val Arg 


Asp 


Cys 


Pro 


His 


Gin 


Gly 


Ser Arg 


Thr 




355 








360 


He 


Ser 
370 


Leu 


Gin 


Tyr 


Phe Gin 
375 


Glu 



Leu 
385 



315 320 
Glu Ser He Glu Asp Asp Asn Gin 

330 335 
Val Phe Asn Val Tyr Leu Asn Thr 
345 350 
lie Arg Ala Glu Glu Ala Met Phe 
365 

Pro He Ser Arg Ala Val Arg Arg 
380 



<210> 112 

<211> 465 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 112 



Met 


Glu 


Leu 


Leu 


1 








Val 


Ala 


vai 


Leu 








20 


Ser 


Phe 


Leu 


Trp 






JO 




Ser 


Ala 


AJ.a 


Ser 




50 






Ser 


Asn 


Leu 


HIS 


65 








Ala 


lie 


Tyr 


Arg 


Ala 


Tyr 


Leu 


He 








100 


Lys 


Glu 


Gly 


Gly 






115 




Ly s 


Val 


lie 


Ser 




130 






Glu 


Gly 


Leu 


Arg 


145 








Ser 


Leu 


He 


Glu 


Ala 


Gly 


Pro 


Val 








180 


Lys 


Gly 


He 


Trp 






195 




Gly Ala 


Met 


He 




210 






Tyr 


Leu 


Tyr 


Leu 


225 








Val 


Tyr 


Gin 


Glu 


Ala 


Gly 


Phe 


Thr 








260 


Glu 


Ala 


Ser 


He 






275 




Glu 


Thr 


Gin 


Thr 




290 







Asp 


lxie t. 


Asn 


O J- 


o 








Arg 


irne 


Leu 


Leu 


Arg 


rile 


Tip 


XT -L \J 








40 


Gly 


Ala 


Phe 


Leu 






D D 




rfie 


Leu 


v ajL 


Pro 




"7 n 






Pro 


Leu 


C? n. tr- 

ber 


Cjxy 


85 








Gly 


Cys 


His 


Val 


He 


Asp 


Ser 


Thr 








120 


Cys 


Ser 


lie 


Asn 






135 




Glu 


Ala 


Gin 


Lys 




150 






Tyr 


Phe 


Gly 


Tyr 


165 








Phe 


Glu 


Met 


Lys 


Ala 


Val 


Ser 


Glu 








200 


Arg 


Ala 


Val 


Phe 






215 




Val 


Pro 


Gin 


Phe 




230 






Trp 


Gly 


Phe 


Leu 


245 








Ala 


Arg 


Trp 


Lys 


lie 


He 


Ser 


Gly 








280 


Lys 


Ala 


Lys 


Trp 






295 





Met Ala Ala Ser 
10 

Cys Phe Val Ala 
25 

Ser Arg Leu Gly 

Ser Tyr Leu Ser 
60 

Met Thr He Gly 
75 

Phe He Thr Phe 

• 90 

Phe Tyr Met Ser 
105 

Gly Ala Leu Met 

Tyr Asn Asp Gly 
14 0 

Lys Asn Arg Leu 
155 

Cys Leu Cys Cys 
170 

Asp Tyr Leu Glu 
185 

Lys Gly Lys Arg 

Gin Ala Ala lie 
220 

Pro Leu Thr Arg 
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Lys Arg Phe Gly 
250 

Tyr Tyr Phe He 
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Leu Gly Phe Ser 

Asp Arg Ala Lys 
300 



He Gly Val Ser 
15 

Thr He Pro lie 
30 

Lys His He Tyr 
45 

Phe Gly Phe Ser 

Tyr Ala Ser Met 
80 

Phe Leu Gly Phe 
95 

Gly Asp Ala Trp 
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Val Leu Thr Leu 
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Met Leu Lys Glu 

lie Gin Met Pro 
160 

Gly Ser His Phe 
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Trp Thr Glu Glu 
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Pro Ser Pro Tyr 
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Cys Met Ala Leu 

Phe Thr Glu Pro 
240 

Tyr Gin Tyr Met 
255 

Trp Ser lie Ser 
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Gly Trp Thr Asp 
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Asn Val Asp lie 



WO 01/85946 



69 



PCT/IB01/01307 



Leu 


Gly Val 


Glu 


Leu 


Ala 


Lys 


Ser 


Ala 


Val 


Gin 


He 


Pro 


Leu 


Pne 


Trp 


305 










310 










315 










320 


Asn 


lie 


Gin 


Val 


Ser 
325 


Thr 


Trp 


Leu 


Arg 


His 
330 


Tyr 


Val 


Tyr 


Glu 


Arg 

■"5 T> CT 


He 


Val 


Lys 


Pro 


Gly Lys Lys 


Ala 


Gly 


Phe 


Phe 


Gin 


Leu 


Leu 


Ala 


Thr 


Gin 








340 










345 










350 






Thr 


Val 


Ser Ala 


Val 


Trp 


His 


Gly 


Leu 


Tyr 


Pro 


Gly 


Tyr 


lie 


He 


Phe 






355 










360 










365 








Phe 


Val 
370 


Gin 


Ser 


Ala 


Leu 


Met 
375 


lie 


Asp 


Gly 


Ser 


Lys 
380 


Ala 


lie 


Tyr 


Arg 


Trp 


Gin 


Gin 


Ala 


He 


Pro 


Pro 


Lys 


Met 


Ala 


Met 


Leu 


Arg 


Asn 


Val 


Leu 


385 










390 










395 










400 


Val 


Leu 


lie 


Asn 


Phe 
4 05 


Leu 


Tyr 


Thr 


Val 


Val 
410 


Val 


Leu 


Asn 


Tyr 


Ser 
415 


Ser 


Val 


Gly 


Phe 


Met 
420 


Val 


Leu 


Ser 


Leu 


His 
425 


Glu 


Thr 


Leu 


Val 


Ala 
430 


Phe 


Lys 


Ser 


Val 


Tyr 
435 


Tyr 


He 


Gly 


Thr 


Val 
440 


He 


Pro 


He 


Ala 


Val 
445 


Leu 


Leu 


Leu 


Ser 


Tyr 
450 


Leu 


Val 


Pro 


Val 


Lys 
455 


Pro 


Val 


Arg 


Pro 


Lys 
4 60 


Thr 


Arg 


Lys 


Glu 



Glu 
4 65 
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<212> PRT 

<213> Arabidopsis thaliana 
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130 135 140 

Pro Asn Lys Val Met He Trp Asp Asp His Gin Gly Arg Cys He Gly 
145 150 155 160 

Glu Leu Ser Phe Arg Ser Asp Val Arg Ser Val Arg Leu Arg Arg Asp 

165 170 175 

Ara He He Val Val Leu Glu Gin Lys He Phe Val Tyr Asn Phe Ser 

180 185 190 

Asp Leu Lys Leu Met His Gin He Glu Thr He Ala Asn Pro Lys Gly 

195 200 205 

Leu Cys Ala Val Ser Gin Gly Val Gly Ser Met Val Leu Val Cys Pro 

210 215 220 

Gly Leu Gin Lys Gly Gin Val Arg He Glu His Tyr Ala Ser Lys Arg 
225 ~ 230 235 240 

Thr Lys Phe Val Met Ala His Asp Ser Arg He Ala Cys Phe Ala Leu 

245 250 255 

Thr Gin Asp Gly His Leu Leu Ala Thr Ala Ser Ser Lys Gly Thr Leu 

260 265 270 

Val Arg He Phe Asn Thr Val Asp Gly Thr Leu Arg Gin Glu Ser Gly 

275 280 285 

Thr Ser Glu Asp Glu He Gly Lys Glu Gly Ala Asp Arg Ala Glu He 

290 295 300 

Tvr Ser Leu Ala Phe Ser Ser Asn Ala Gin Trp Leu Ala Val Ser Ser 
305 . 310 315 320 

Asp Lvs Gly Thr Val His Val Phe Gly Leu Lys Val Asn Ser Gly Ser 

325 330 335 

Gin Val Lys Asp Ser Ser Arg He Ala Pro Asp Ala Thr Pro Ser Ser 

340 345 350 

Pro Ser Ser Ser Leu Ser Leu Phe Lys Val Leu Pro Arg Tyr Phe Ser 

355 360 365 

Ser Glu Trp Ser Val Ala Gin Phe Arg Leu Val Glu Gly Thr Gin Tyr 

370 375 380 

He Ala Ala Phe Gly His Gin Lys Asn Thr Val Val He Leu Gly Met 
385 390 395 400 

Asp Gly Ser Phe Tyr Arg Cys Gin Phe Asp Pro Val Asn Gly Gly Glu 

405 ' 410 415 

Met Ser Gin Leu Glu Tyr His Asn Cys Leu Lys Pro Pro Ser Val Phe 
420 425 430 
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350 






Glu 


Leu 


Thr 


Glu 


Arg 


Lys 


Lys 


Val 










365 








Thr 


Ala 


Leu 


Leu 


Gly 


Gin 


He 


Lys 








380 










Lys 


Lys 


Glu 


Ser 


Asp 


Met 


Met 


Met 






395 










400 


Leu 


Met 


Ala 


Ala 


Leu 


Lys 


Gly 


Lys 




410 










415 




Phe 


Ala 


He 


Met 


Tyr 


Leu 


He 


Ser 


425 










430 
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Thr 


Glu 


Thr 


He 






435 




Asn 


Glu 


Ala 


Glu 




450 






Lys 


Ser 


Leu 


Asn 


4 65 








Arg 


Ser 


Asn 


lie 


Ser 


Ala 


Val 


Thr 








500 


Leu 


Ala 


Val 


Thr 






515 




Pro 


Glu 


lie 


Asp 




530 






Ser 


Ser 


Ser 


Gly 


c a a 

545 








Val 


Phe 


Met 


He 


Glu 


Leu 


Thr 


Gin 








580 


Thr 


Glu 


He 


Leu 






595 




Gly 


Lys 


Lys 


Met 




610 






Leu 


Gly 


Met 


Ala 


625 








Leu 


Thr 


Arg 


Glu 


Arg 


Phe 


Gin 


Val 








660 


Tyr 


Leu 


Glu 


Phe 






675 




Leu 


Ser 


Gly 


Arg 




690 






Ser 


Lys 


Ala 


Arg 


705 








Cys 


Gly 


Val 


Glu 


Lys 


Asp 


Gly 


Gly 








740 


Glu 


Thr 


Arg 


Arg 






755 




Thr 


Asn 


Glu 


Met 




770 






Ser 


Leu 


Val 


Arg 


785 








Lys 


Glu 


Arg 


Ala 


Gin 


lie 


Glu 


Asn 








820 


Tyr 


Gly 


Thr 


Asn 






835 




Leu 


Val 


Glu 


Glu 




850 






Glu 


Glu 


Leu 


Cys 


865 








Val 


Ser 


Thr 


Phe 



Asn 


bin 


Ser 










440 


Ala 


Asp 


Thr 


Ser 






A R R 




-/4j_a 


-Ser 


irne 


/iia 




4 70 






vai 


Asp 


Trp 


rila 


A R ^ 








AJ_a 


Gly 


vai 


Lys 


Arg 


inr 


va jl 


oJLU 








520 


Ser 


Tyr 


Arg 


Phe 






Djj 




Gly 


Ser 


HIS 


val 




ccn 
OjU 






Gly 


Gly 


Gly 


Asn 


ODD 








Arg 


Gin 


Leu 


Thr 


Asn 


Gly 


Gly 


talU 








600 


Gly 


Leu 


Gly 


Gly 






615 




Gly 


Lys 


Glu 


Glu 




630 






Ala 


Thr 


Glu 


He 


645 








Asp 


Ser 


Leu 


Glu 


Gly 


Ser 


Glu 


Glu 








680 


Val 


Arg 


Ser 


Thr 






by o 




Val 


Leu 


Ala 


He 




710 






Gin 


He 


Glu 


Glu 


725 








Ser 


Ser 


Ser 


Ser 


Tyr 


Ser 


Gly 


Ser 








760 


Leu 


Gin 


Ser 


He 






775 




Arg 


Val 


Thr 


Val 




/ y u 






Leu 


Leu 


Gly 


Glu 


one 








Leu 


Ser 


Val 


Lys 


Ser 


Val 


Leu 


Asn 








840 


Thr 


Met 


Arg 


Gin 






855 




Arg 


Val 


Lys 


Arg 




870 






Thr 


Asn 


Val 


Arg 



vai 


blU 


rila 


\7 ra 1 

vai 


Aia 


jrne 


uin 


Tyr 








4 60 


Ala 


Thr 


Ser 


Ala 






A 7 R 




olU 


Lys 


Leu 


Tyr 




a Q-n 






Asn 


Leu 


Leu 


oer 


OUj 








AJLa 


jueu 


l nr 


G1U 


T Ai i 

Leu 


Asp 


Pro 


Arg 








540 


Lys 


Gly 


Pro 


Phe 






c c c 

555 




Tyr 


IT- I 

vai 


GlU 


Tyr 




o / u 






Val 


Lys 


Asn 


val 


C Q C 








Leu 


Val 


Glu 


Gin 


Pro 


Val 


Ala 


Ser 








620 


Thr 


Asp 


Val 


Ser 






635 




Trp 


Arg 


Ser 


Glu 










Ala 


Glu 


Leu 


Val 


a a ^ 








Asp 


Ala 


Arg 


Lys 


Ala 


Thr 


Met 


Leu 








700 


Pro 


Asp 


Asp 


Leu 






1 15 




Leu 


Lys 


Gly 


Leu 




730 






Asp 


Gly 


Ala 


Arg 


745 








Leu 


Gly 


Val 


Glu 


Glu 


Met 


Val 


Thr 








780 


Ala 


Glu 


Ser 


Glu 






/ y o 




Glu 


Glu 


He 


Ser 




810 






Leu 


Glu 


Glu 


Met 


825 








Glu 


Met 


Arg 


Glu 


Arg 


Glu 


Lys 


Ala 








860 


Glu 


Phe 


Glu 


Ser 






875 




Glu 


Thr 


Leu 


Leu 



bill 


Aid 


711 a 
/ila 


Leu 


A A ^ 
t ± H D 








vai 


Lys 


Lys 


lie 


.Asn 


Ser 


HJla 


s er 








480 


Gly 


Gin 


Ser 


He 










oer 


ASp 


Gin 


Gin 




DlU 






Gly 


Lys 


Pro 


Asn 










Ala 


Pro 


Lys 


Ser 


Arg 


GlU 


TV "I _ 

Ala 


lie 








560 


Gly 


Ser 


Leu 


Gin 






5/5 




He 


Tyr 


Gly 


Ala 




r- O M 

5yo 






Leu 


Gly 


Leu 


Leu 


bUo 








Thr 


Leu 


Lys 


Arg 


TV 1 

Ala 


Gin 


Gly 


Ser 








640 


Leu 


Glu 


Ser 


Arg 






/r c c 

655 




Asp 


Val 


Lys 


Ala 




670 






Glu 


Leu 


Gly 


TT_ 1 

Val 


685 








Arg 


Tyr 


Leu 


Arg 


Ala 


Asn 


Val 


Ser 








720 


Asn 


Leu 


Val 


Glu 






735 




Asn 


Thr 


Asn 


Pro 




/ oU 






Asp 


Gly 


Ala 


Tyr 


765 








Asp 


Val 


Leu 


Asp 


Ser 


Ala 


Val 


Gin 








800 


Arg 


Lys 


Thr 


He 






QIC 

o 15 




Glu 


Arg 


Phe 


Ala 




q "5 n 






Arg 


He 


Glu 


Glu 


845 








Val 


Glu 


Asn 


Glu 


Leu 


Lys 


Ser 


Tyr 








880 


Ser 


Ser 


Glu 


Arg 
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885 

Gin Phe Lys . Thr He 
900 

Thr Gin Leu Glu Gly 
915 

Leu Met Glu Glu Asn 
930 

Ala Gin Leu Leu Ala 
945 

Ala Ser Asn He 



890 

Glu Glu Leu Phe Glu Arg 
905 

Glu Lys Ala Gin Lys Glu 
920 

Val Lys Leu Thr Ala Leu 
935 

Leu Asn Glu Gin Cys Lys 
950 955 



895 

Leu Val Thr Lys Thr 
910 

Val Glu Val Gin Lys 
925 

Leu Asp Lys Lys Glu 
940 

Val Met Ala Leu Ser 
960 



<210> 124 

<211> 222 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 124 



Met 


Asp 


Ala 


Met 


Lys 


Glu 


Glu 


He 


Gin 


Arg 


Val 


Lys 


Glu 


Gin 


Glu 


Glu 


1 








5 










10 










15 




Gin 


Ala 


Met 


Arg 
20 


Glu 


Ala 


Leu 


Gly 


Leu 
25 


Ala 


Pro 


Lys 


Ser 


Ser 
30 


Thr 


Arg 


Pro 


Gin 


Gly 
35 


Asn 


Arg 


Leu 


Asp 


Lys 
40 


Gin 


Glu 


Phe 


Thr 


Glu 
45 


Leu 


Val 


Lys 


Arg 


Gly 


Ser 


Thr 


Ala 


Glu 


Asp 


Leu 


Gly Ala 


Gly 


Asn 


Ala 


Asp 


Ala 


Val 




50 










55 










60 










Trp 


Val 


His 


Gly 


Leu Gly 


Tyr 


Ala 


Lys 


Ala 


Pro 


Arg 


Pro 


Trp 


Glu 


Asp 


65 










70 










75 










80 


Pro 


Ser 


Thr 


Leu 


Ala 
85 


Ser 


Ser 


Gin 


Lys 


Glu 
90 


Asp 


Ala 


Asp 


Ser 


Ala 
95 


Arg 


Leu 


Pro 


Ala 


Asp 
100 


Thr 


Ser 


Gly 


Val 


Lys 
105 


Thr 


Val 


Glu 


Asp 


Gly 
110 


Pro 


Asp 


Asp 


Val 


Glu 


Arg Asp 


Gin 


Arg 


Arg 


lie 


Gly Val 


Arg 


Lys 


Gly Asn 


Leu 






115 










120 










125 








Gin 


Arg 
130 


Glu 


Arg 


Arg 


Lys 


Lys 
135 


Asp 


Met 


He 


Gly 


Val 
140 


Lys 


Asn 


Ala 


Lys 


Gly 


Met 


Arg 


Ser 


Glu 


Ala 


Leu 


Val 


He 


Gin 


Met 


He 


Glu 


Arg 


Ser 


Thr 


145 








150 










155 










160 


Arg 


Lys 


Arg 


Arg 


Arg Arg 


Lys 


Lys 


Glu Gly Met 


Thr 


Leu 


lie 


Leu 


He 










165 










170 










175 




Glu 


Ala 


Asn 


Cys 


Pro Arg 


Met 


Glu 


His 


Phe 


Ala 


Leu 


Gin 


Arg 


Lys 


Ser 








180 










185 










190 






Gly 


Arg 


Leu 


Gly 


Thr 


Lys 


He 


Gin 


Leu 


Pro 


Leu 


Leu 


Gin 


Asp 


Leu 


Asn 




195 










200 










205 








Leu 


Leu 
210 


Leu 


lie 


Ser 


Phe 


Thr 
215 


Asn 


Arg 


Gly 


Val 


Lys 
220 


Cys 


Cys 







<210> 125 

<211> 148 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 125 

Met Gly Lys Asp Gly Leu Ser Asp Asp Gin Val Ser Ser Met Lys Glu 
15 10 15 
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Ala 


Phe 


Met 


Leu 
20 


Phe 


Asp 


Thr 


Asp 


Glu 


Leu Gly 


He 


Leu 


Met 


Arg 


Ser 






35 










40 


Gin 


Leu 
50 


Lys 


Sex 


He 


He 


Ala 
55 


Ser 


Phe 


Asn Arg 


Phe 


Leu 


Asp 


Leu 


Met 


65 










70 






Phe 


Asp 


Arg 


Gin 


Leu 


Arg 


Asp Ala 










85 








Thr 


Gly 


Phe 


Val 
100 


Ala 


Val 


Ala 


Asp 


Gly 


Glu 


Lys 
115 


Leu 


Glu 


Pro 


Asn 


Glu 
120 


Asp 


Val 
130 


Gly 


Ser 


Asp 


Gly 


Lys 
135 


He 


Met 


Val 


Ala 


Lys 










145 

















<210> 126 












<211> 70 












<212> PRT 












<213> Arabidopsis thaliana 




<400> 126 












Met Glu Lys 


Gin 


Ser 


Thr 


Gin 


Pro 


1 




5 








Leu Leu Asn 


Cys 


Val 


Ala 


Glu 


Ser 




20 










Arg Phe Leu 


Gin 


Ser 


Leu 


Arg 


Glu 


35 










40 


Lys Phe Ser 


He 


Pro 


Ser 


Gin 


Asp 


50 








55 




Ala Thr Lys 


Arg 


Pro 


Ser 






65 






70 







<210> 127 

<211> 385 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 127 



Met Thr 


Thr 


Thr 


Gly 


Ser 


Asn 


Ser 


1 






5 








Asn Asn 


Asn 


Asn 


Asn 


Pro 


Ser 


Thr 






20 










Gly Gin 


Ser 


Val 


Ser 


Thr 


Ser 


Gly 




35 










40 


Ser Glu 


Gin 


Thr 


He 


Thr 


Val 


Val 


50 










55 




Gin Arg 


Leu 


Asn 


Asn 


Leu 


Asp 


He 


65 








70 






Gly Ala 


Ser 


Gly 


Val 


Lys 


Lys 


Lys 








85 








Pro Asp 


Lys 


Thr 


Gly 


Arg 


Gly Leu 



Gly Asp 


Gly Lys 


He 


Ala 


Pro 


Ser 


25 








30 






Leu Gly Gly Asn 


Pro 


Thr 


Gin 


Ala 








45 








Glu 


Asn 


Leu Ser 


Ser 


Pro 


Phe 


Asp 






60 










Ala 


Lys 


His Leu 


Lys 


Thr 


Glu 


Pro 






75 








80 


Phe 


Lys 


Val Leu 


Asp 


Lys 


Glu 


Gly 




90 








95 




Leu 


Arg 


His He 


Leu 


Thr 


Ser 


He 


105 








110 






Phe 


Asp 


Glu Trp 


He 


Lys 


Glu 


Val 








125 








Arg 


Tyr 


Glu Asp 


Phe 


He 


Ala 


Arg 






140 










He 


Cys 


Gly Gin 


Glu 


Ala 


Leu 


Gin 




10 








15 




Pro 


Phe 


Asp Gin 


Glu 


Lys 


Cys 


Val 


25 








30 






Cys 


Val 


Leu Ser 


Lys 


Lys 


Val 


Lys 








45 








His 


Asp 


Ser Glu 


Gly 


Ala 


Ala 


Ser 






60 










Asn 


His 


Asn His 


His 


Glu 


Ser 


Asn 




10 








15 




Arg 


Ser 


Trp Gly 


Thr 


Ala 


Val 


Ser 


25 








30 






Ser 


Met 


Gly Ser 


Pro 


Ser 


Ser Arg 








45 








Thr 


Ser 


Thr Ser 


Asp 


Thr 


Thr 


Phe 






60 










Gin 


Gly Asp Asp 


Ala 


Gly 


Ser 


Gin 






75 








80 


Lys 


Arg 


Gly Gin 


Arg 


Ala 


Ala 


Gly 




90 








95 




Arg 


Gin 


Phe Ser 


Met 


Lys 


Val 


Cys 
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100 

Glu Lys Val Glu 
115 

Glu Leu Val Ala 
130 

Asp Gin Gin Gin 
145 

Ala Leu Asn Val 

Glu lie Gin Trp 
180 

Glu Leu Lys Asn 
195 

Thr Ala Tyr Ser 
210 

Leu lie Gin Arg 
225 

Gly Gly Val Ala 

Thr Val Glu Val 
260 

Phe Asn Ser Thr 
275 

Thr Met Lys Phe 
290 

Asn Ser Gin Leu 
305 

Gly Pro Ser Thr 

Leu Gin Ser Gin 
340 

Met Pro Glu Thr 
355 

Lys Ser Pro Ser 
370 

Asn 
385 



Ser Lys Gly Arg 
120 

Glu Phe Ala Leu 
135 

Tyr Asp Glu Lys 
150 

Leu Met Ala Met 
165 

Arg Gly Leu Pro 

Glu Arg Leu Ser 
200 

Gin Glu Leu Glu 
215 

Asn Glu His Leu 
230 

Leu Pro Phe lie 
245 

Glu lie Ser Glu 

Pro Phe Glu Leu 
280 

Cys Asp Gin Pro 
295 

Val Cys His Asn 
310 

Gly Pro Thr Pro 
325 

Gin His Gin Gin 

Asn Asn Val Thr 
360 

Leu Pro Gly lie 
375 



105 

Thr Thr Tyr Asn 

Pro Asn Asn Asp 
140 

Asn He Arg Arg 
155 

Asp He He Ser 
170 

Arg Thr Ser Leu 
185 

Leu Arg Asn Arg 

Glu Gin Tyr Val 
220 

Tyr Ser Ser Gly 
235 

Leu Val Gin Thr 
250 

Asp Met Gin Leu 
265 

His Asp Asp Asn 

Pro Gin Gin Pro 
300 

Phe Thr Pro Glu 
315 

Gin Leu Asp Met 
330 

His Ser Gin Leu 
345 

Ser Ser Ala Asp 

Met Asn Ser Ser 
380 



110 

Glu Val Ala Asp 
125 

Gly Thr Ser Pro 

Arg Val Tyr Asp 
160 

Lys Asp Lys Lys 
175 

Ser Asp lie Glu 
190 

He Glu Lys Lys 
205 

Gly Leu Gin Asn 

Asn Ala Pro Ser 
240 

Arg Pro His Ala 
255 

Val His Phe Asp 
270 

Phe Val Leu Lys 
285 

Asn Gly Arg Asn 

Asn Pro Asn Lys 
320 

Tyr Glu Thr His 
335 

Gin He He Pro 
350 

Thr Ala Pro Val 
365 

Met Lys Pro Glu 



<210> 128 

<211> 437 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 128 



Met 


Ala 


Leu 


Gin 


Asn 


He 


Gly 


Ala 


1 








5 








Arg 


Tyr 


Lys 


Met 
20 


Pro 


Lys 


Met 


Val 


Gly 


He 


Lys 


Thr 


Asn 


He 


He 


Asn 




35 










40 


Ala 


Arg 
50 


Pro 


Pro 


Ser 


Tyr 


Thr 
55 


Thr 


Ala 


Gin 


Ser 


Lys 


Phe 


Asp 


Glu 


Lys 


65 










70 






Ala 


His 


Asn 


Thr 


Ser 


Lys 


Leu 


Ala 



85 



Ser Asn Arg Asn Asp Ala Phe Tyr 

10 15 

Thr Lys Thr Glu Gly Lys Gly Asn 
25 " 30 

Asn Val Glu He Ala Lys Ala Leu 
45 

Lys Tyr Phe Gly Cys Glu Leu Gly 
60 

Thr Gly Thr Ser Leu Val Asn Gly 

75 80 

Gly Leu Leu Glu Asn Phe He Lys 

90 95 



WO 01/85946 



87 



.PCT/1B01/01307 



Lys Phe 


Val 


Gin 


Cys 


Tyr 


Gly Cys 


Gly Asn 


Pro 


Glu 


Thr 


Glu 


He 


He 






.100 










105 










110 






lie Thr Lys ,Thr 


Gin 


Met 


Val 


Asn .Leu 


Lys 


Cys 


.Ala 


Ala 


Cys 


Gly 


Phe 




115 










120 










125 








lie Ser Glu Val Asp 


Met 


Arg Asp .Lys 


Leu 


Thr 


Asn 


Phe 


lie 


Leu 


Lys 


130 










135 










140 










Asn Pro 


Pro 


Glu 


Gin 


Lys 


Lys 


Val 


Ser 


Lys 


Asp 


Lys 


L Y S 


Ala 


Met 


Arg 


145 








150 










155 










160 


Lys Ala 


Glu Lys 


Glu 


Arg 


Leu 


Lys 


Glu Gly Glu 


Leu 


Ala 


Asp 


Glu 


Glu 








165 










170 










175 




Gin Arg 


Lys 


Leu 


Lys 


Ala 


Lys 


Lys 


Lys Ala 


Leu 


Ser 


Asn 


Gly 


Lys 


Asp 






180 










185 










190 






Ser Lys 


Thr 


Ser Lys 


Asn 


His 


Ser 


Ser Asp 


Glu 


Asp 


lie 


Ser 


Pro 


Lys 




195 










200 










205 








His Asp 


Glu 


Asn 


Ala 


Leu 


Glu 


Val 


Asp 


Glu Asp 


Glu 


Asp 'Asp Asp 


Asp 


210 










215 










220 










Gly Val 


Glu 


Trp 


Gin 


Thr 


Asp 


Thr 


Ser 


Arg 


Glu 


Ala 


Ala 


Glu 


Lys 


Arg 


225 








230 










235 










24 0 


Met Met 


Glu 


Gin 


Leu 
245 


Ser 


Ala 


Lys 


Thr 


Ala 
250 


Glu 


Met 


Val 


Met 


Leu 

255 


Ser 


Ala Met 


Glu 


Val 
260 


Glu 


Glu 


Lys 


Lys 


Ala 
265 


Pro 


Lys 


Ser 


Lys 


Ser 
270 


Asn 


Gly 


Asn Val 


Val 
275 


Lys 


Thr 


Glu 


Asn 


Pro 
280 


Pro 


Pro 


Gin 


Glu 


Lys 
285 


Asn 


Leu 


Val 


Gin Asp Met 


Lys 


Glu 


Tyr 


Leu 


Lys 


Lys 


Gly 


Ser 


Pro 


He 


Ser 


Ala 


Leu 


290 










295 










300 










Lys Ser 


Phe 


He 


Ser 


Ser 


Leu 


Ser 


Glu 


Pro 


Pro 


Gin 


Asp 


He 


Met 


Asp 


305 








310 










315 










320 


Ala Leu 


Phe 


Asn 


Ala 


Leu 


Phe Asp 


Gly Val 


Gly 


Lys 


Gly 


Phe 


Ala 


Lys 








325 










330 










335 




Glu Val 


Thr 


Lys 
340 


Lys 


Lys 


Asn 


Tyr 


Leu 
345 


Ala 


Ala 


Ala 


Ala 


Thr 
350 


Met 


Gin 


Glu Asp 


Gly 
355 


Ser 


Gin 


Met 


His 


Leu 
360 


Leu 


Asn 


Ser 


He 


Gly 
365 


Thr 


Phe 


Cys 


,Gly Lys 


Asn 


Gly Asn 


Glu 


Glu 


Ala 


Leu 


Lys 


Glu 


Val 


Ala 


Leu 


Val 


Leu 


370 










375 










380 










Lys Ala 


Leu 


Tyr Asp 


Gin 


Asp 


He 


He 


Glu 


Glu 


Glu 


Val 


Val 


Leu 


Asp 


385 








390 










395 










400 


Trp Tyr 


Glu 


Lys 


Gly 


Leu 


Thr 


Gly Ala Asp Lys 


Ser 


Ser 


Pro 


Val 


Trp 








405 










410 










415 




Lys Asn 


Val 


Lys 


Pro 


Phe 


Val 


Glu 


Trp 


Leu 


Gin 


Ser 


Ala 


Glu 


Ser 


Glu 



420 425 430 



Ser Glu Glu Glu Asp 
435 



<210> 129 
<211> 749 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 129 

Met Ala Ala Asn Lys Phe Ala Thr 
1 5 
He Thr Leu He Leu Val Tyr Ala 
20 

Phe He Leu Leu Asn Ser Leu Phe 



Leu He His Arg Lys Thr Asn Arg 

10 15 
Phe Leu Glu Trp Ser Leu He Phe 
25 30 
Ser Tyr Phe He Leu Arg Phe Ala 
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35 










40 


Asp 


Tyr 


Phe 


Gly 


Leu 


Lys Arg 


Pro 




50 










55 




Arg 


Phe 


Phe 


Asp 


Ala 


Ser 


Gly Lys 


65 










70 






Cys 


Asp 


Asp 


His 


Ala 


Leu 


Gin 


Leu 










85 








Asn 


Cys 


Gly 


Phe 


Gly 


Glu 


Phe 


His 








100 










Cys 


Val 


Glu 


Lys 


He 


Ser 


Ser 


Ser 






115 










120 


Phe 


Gly 


Asn 


Leu 


Asp 


Tyr 


Pro 


He 




130 










135 




Gly 


Leu 


Lys 


Phe 


Pro 


Arg 


Ser 


He 


145 










150 






Gly 


Ser 


Val 


Asn 


Leu 


Asn Asp 


Ser 










165 








Val 


Pro 


Gin 


Ser 


His 


Glu 


Lys 


Leu 








180 










Phe 


Ser 


Cys 


Tyr 


Val 


Ser 


Ser 


Phe 






195 










200 


Thr 


Glu 


Lys 


Glu 


Glu 


Glu Asn Arg 




210 










215 




Thr 


Ala 


Glu 


Ser 


Ala 


Pro 


Lys 


Asn 


225 










230 






Asp 


Cys 


His 


Leu 


He 


Pro 
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Val 


Pro 


Arg 
455 


Arg 


Gly 


Lys 


Thr 


Pro 
460 


Leu 


Asp 


Pro 


Asp 


Ala 


Ser 


Gly 


Phe 


lie 


Tyr 


Ser 


Asn 


Pro 


He 


Glu 


Thr 


Ser 


Phe 


Lys 


Gin 


4 65 








470 










475 










480 


Arg 


Cys 


Leu 


Glu 


Asp 


Trp 


Lys 


Val 


His 


His 


Arg 


Lys 


Leu 


Leu 


Arg 


Thr 






485 










490 










4 95 




Leu 


Gin 


Ser 


Glu 


Gly 


Leu 


Pro 


Val 


Leu 


Gly 


Asp 


Ala 


Ser 


Glu 


Ser 


Asp 








500 








505 










510 






Tyr 


Met 


Arg 


Val 


Val 


Glu 


Arg 


Leu 


Arg 


Asn 


He 


He 


Lys 


Gly 


Pro 


Ala 




515 










520 










525 








Leu 


Asn 
530 


Leu 


Leu 


Lys 


Pro 


Lys 
535 


Ala 


Ala 


Ser 


Lys 


Met 
540 


Val 


Val 


Ser 


Glu 


Leu 


Lys 


Glu 


Glu 


Leu 


Glu 


Ala 


Gin 


Gly 


Leu 


Pro 


He 


Asp 


Gly 


Thr 


Arg 


545 








550 










555 










560 


Asn 


Val 


Leu 


Tyr 


Gin 


Arg 


Val 


Gin 


Lys 


Ala 


Arg 


Arg 


He 


Asn 


Lys 


Ser 








565 










570 










575 




Arg 


Gly 


Arg 


Pro 
580 


Leu 


Trp 


Val 


Pro 


Pro 
585 


He 


Glu 


Glu 


Glu 


Glu 
590 


Glu 


Glu 


Val 


Asp 


Glu 


Glu 


Val 


Asp 


Asp 


Leu 


He 


Cys 


Arg 


He 


Lys 


Leu 


His 


Glu 




595 










600 










605 








Gly 


Asp 


Thr 


Glu 


Phe 


Trp 


Lys 


Arg 


Arg 


Phe 


Leu 


Gly 


Glu 


Gly 


Leu 


He 


610 










615 










620 










Glu 


Thr 


Ser 


Val 


Glu 


Ser 


Lys 


Glu 


Thr 


Thr 


Glu 


Ser 


Val 


Val 


Thr 


Gly 


625 










630 








635 










640 


Glu 


Ser 


Glu 


Lys 


Ala 


He 


Glu 


Asp 


He 


Ser 


Lys 


Glu 


Ala 


Asp 


Asn 


Glu 








645 










650 










655 




Glu 


Asp 


Asp 


Asp 


Glu 


Glu 


Glu 


Gin 


Glu 


Gly 


Asp 


Glu 


Asp 


Asp 


Asp 


Glu 




660 










665 










67 0 






Asn 


Glu 


Glu 


Glu 


Glu 


Val 


Val 


Val 


Pro 


Glu 


Thr 


Glu 


Asn 


Arg 


Ala 


Glu 



675 680 685 
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Gly 


Glu 


Asp 


jjeu 


vai 


Lys 


Asn 


Lys 


















Gin 


Met 


lie 


Gly 


Val 


bin 


Lieu 


Leu 


/UO 










*7 1 n 
/ iu 






Thr 


Lys 


Lys 


Arg 


Gly 


Lys 


Arg 


Ala 










"TO R 
/ZD 








TV T _ 

Ala 


Asp 


Glu 


Asp 


Trp 


Pne 


Pro 


Glu 








n /i n 
/4U 










Met 


Arg 


Glu 


Arg 


Lys 


Val 


Pne 


Asp 






75 5 










1 bU 


Asp 


Val 


Trp 


Gly 


Trp 


Thr 


Trp 


Glu 




770 










/ 75 




Arg 


Lys 


Trp 


Ser 


Gin 


Glu 


Trp 


Glu 


TOG 

785 










f yu 






Thr 


Lys 


Val 


He 


Glu 


Leu 


Gly 


Gly 










805 








Val 


lie 


Leu 


Arg 


Ala 


Ala 


Leu 


Arg 








o o r\ 

820 










Lys 


lie 


Leu 


Gin 


Thr 


Thr 


His 


Ser 
















a 4 n 

O fl u 


Leu 


Tyr 


Asp 


Glu 


He 


He 


Thr 


Leu 




850 










855 




Ala 


Ala 


He 


Ala 


He 


Val 


Ala 


Asp 


865 










870 






Pro 


Asp 


Gin 


Thr 


Leu 


Asp 


Lys 


Val 










885 








Ser 


Pro 


Arg 


Ser 


Glu 


Pro 


Glu 


Glu 



900 



<210> 132 

<211> 357 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 132 



Met 


Glu 


Gly 


Ser 


Ser 


Ser 


Ala 


He 


1 








5 








Asn 


Asn 


He 


Leu 


Pro 


Val 


Glu 


Pro 








20 










Phe 


His 


Tyr 


Asp 


Asp 


Ala 


Ser 


Gin 






35 










40 


Trp 


Ala 


Ser 


Asp 


Pro 


Asn 


Tyr 


Phe 




50 










55 




Ala 


Leu 


Leu 


Lys 


Met 


Val 


Val 


His 


65 










70 






He 


Met 


Gly 


Leu 


Met 


Gin 


Gly 


Lys 










85 








Met 


Asp 


Ala 


Phe 


Ala 


Leu 


Pro 


Val 








100 










Ala 


Gin 


Ser 


Asp 


Ala 


Tyr 


Glu 


Tyr 






115 










120 


Lys 


Leu 


Ala 


Gly 


Arg 


Leu 


Glu 


Asn 




130 










135 




Pro 


Gly 


Tyr 


Gly 


Cys 


Trp 


Leu 


Ser 


145 










150 






Leu 


Asn 


Gin 


Gin 


Tyr 


Gin 


Glu 


Pro 



Aia 




Asp 


Ala 


Lys 


Lys 


His 


J_>fc: U 








700 










Lys 




ber 


Asp 


Glu 


Aia 


Asn 


Arg 






/lj 










720 


Ser 


Arg 


Tvyr _ 0- 

Jxie L 


Thr 


Leu 


CI n 
blU 


Asp 


Asp 




/ jU 










701; 




CalU 


Pro 


irne 


Glu Ala 


trie 


Lys 


UlU 


*7 A C 










/ JV 






Vai 


Ala 


Asp 


Met 


Tyr 


inr 


±ie 


nla 










*7 i^" C 

765 








Lys 


Asp 


irne 


Lys 


Asn 


Lys 


1 nr 


Pro 








780 










Val 


CalU 


lieu 


Ala 


He 


vai 


Leu 


i v je l 
















ft nn 


lie 


Pro 


Thr 


He 


Gly 


Asp 


Cys 


Aia 




810 










01 c 
oIj 




Ala 


Pro 


Met 


Pro 


Ser 


Ala 


Fne 


Leu 


o o c 

825 










D J5U 






Leu 


Gly 


Tyr 


Ser 


Phe 


Gly 


ber 


Pro 










845 








Cys 


Leu 


Asp 


Leu 


Gly 


Glu 


Leu 


Asp 








860 










Met 


Glu 


Thr 


Thr 


Gly 


He 


Thr 


Val 






875 










880 


He 


Ser 


Ala 


Arg 


Gin 


Ser 


Asn 


Glu 




890 










895 




Pro 


Ala 


Ser 


Thr 


Val 


Ser 


Ser 




905 










910 







Ala 


Arg 


Lys 


Thr 


Trp 


Glu 


Leu 


Glu 




10 










15 




Thr 


Asp 


Ser 


Ala 


Ser 


Asp 


Ser 


He 


25 










30 






Ala 


Lys 


He 


Gin 


Gin 


Glu 


Lys 


Pro 










45 








Lys 


Arg 


Val 


His 


He 


Ser 


Ala 


Leu 








60 










Ala 


Arg 


Ser 


Gly 


Gly 


Thr 


He 


Glu 






75 










80 


Thr 


Glu 


Gly 


Asp 


Thr 


He 


He 


Val 




90 










95 




Glu 


Gly 


Thr 


Glu 


Thr^ 


Arg 


Val 


Asn 


105 








110 






Met 


Val 


Glu 


Tyr 


Ser 


Gin 


Thr 


Ser 










125 








Val 


Val 


Gly 


Trp 


Tyr 


His 


Ser 


His 








140 










Gly 


He 


Asp 


Val 


Ser 


Thr 


Gin 


Met 






155 










160 


Phe 


Leu 


Ala 


Val 


Val 


He 


Asp 


Pro 
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Thr 


Arg 


Thr 


Val 








180 


Tyr 


Pro 


Glu 


Gly 






195 




Thr 


He 


Pro 


Leu 




210 






Tyr 


Tyr 


Ser 


Leu 


225 








Leu 


Leu 


Asp 


Leu 


OCX 


Pro 


Leu 


Leu 








260 


Leu 


Ala 


Glu 


Lys 






275 




Tyr 


Gly 


Gly 


He 




290 






Pro 


Gin 


Leu 


Ala 


305 








Gin 


Val 


His 


Gly 


Ser 


Ala 


Arg 


Gin 








340 


Pro 


Met 


He 


Thr 






355 





165 



Ser 


Ala 


Gly 


Lys 


His 


Lvs 


lie 


Ser 








200 


Asn 


Lys 


He 


Glu 






215 




Asp 


He 


Thr 


Tyr 




230 






Leu 


Trp 


Asn 


Lys 


245 








Gly 


Asn 


Gly 


Asp 


Leu 


Glu 


Gin 


Ala 








280 


Ala 


Pro 


Ala 


Gly 






295 




Lys 


He 


Thr 


Arg 




310 






Leu 


Met 


Ser 


Gin 


325 








Ser 


Lys 


Lys 


Ser 



Ser 



170 



Val 


Glu 


He 


Gly 


185 








Asp 


Asp 


His 


Val 


Asp 


Phe 


Gly 


Val 
220 


Phe 


Lys 


Ser 
235 


Ser 


Tyr 


Trp 
250 


Val 


Asn 


Tyr 


Val 


Ala 


Gly 


265 








Glu 


Ser 


Gin 


Leu 


His 


Gin 


Arg 


Arg 
300 


Asp 


Ser 


Ala 
315 


Lys 


Val 


He 
330 


Lys 


Asp 


Ala 


Asp 


Asp 


Ser 


345 













175 




Ala 


Phe 


Arg 


Thr 




190 






Ser 


Glu 


Tyr 


Gin 


205 








His 


Cys 


Lys 


Gin 


Leu 


Asp 


Ser 


His 








240 


Thr 


Leu 


Ser 


Ser 






255 




Gin 


He 


Ser 


Asp 




270 






Ala 


Asn 


Ser 


Arg 


285 








Lys 


Glu 


Asp 


Glu 


He 


Thr 


Val 


Glu 








320 


He 


Leu 


Phe 


Asn 






335 




Ser 


Asp 


Pro 


Glu 




350 







<210> 133 

<211> 57 

<212> DNA 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 133 

ggggacaagt ttgtacaaaa aagcaggctt cacaatggtt agatcagatg aaaatag 



<210> 134 

<211> 54 

<212> DNA 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 134 

ggggaccact ttgtacaaga aagctgggtt cttattaata ttaaatcaga aacc 



<210> 135 

<211> 52 

<212> DNA 

<213> artificial sequence 



<220> 
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<223> Description of Artificial Sequence: primer 
<4 00> 135 

ggggacaagt ttgtacaaaa aagcaggctt cacaatggta aatccgggtc ac 52 



<210> 136 

<211> 51 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 136 

ggggaccact ttgtacaaga aagctgggtt ttctgtagtc agacctggat a 51 



<210> 137 

<211> 55 

<212> DNA 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 137 

ggggacaagt ttgtacaaaa aagcaggctt cacaatgggg aaggaaaatg ctgtg 



55 



<210> 138 

<211> 54 

<212> DNA 

<213> artificial sequence 



<220> 



<223> 


Description of Artificial 


Sequence: primer 


<400> 


138 




ggggaccact ttgtacaaga aagctgggtc 


cttcagaata gcgtgtcaag 


<210> 


139 




<211> 


54 




<212> 


DNA 




<213> 


artificial sequence 




<220> 






<223> 


Description of Artificial 


Sequence : primer 


<400> 


139 




ggggacaagt ttgtacaaaa aagcaggctt 


cacaatgggg aagaagtgtg 


<210> 


140 




<211> 


50 




<212> 


DNA 





54 



54 
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<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 140 

ggggaccact ttgtacaaga aagctgggtt gtgagttaaa caacaaccgt 50 



<210> 141 

<211> 55 

<212> DNA 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 141 

ggggacaagt ttgtacaaaa aagcaggctt cacaatggtt aactcatgcg agaac 



<210> 


142 


<211> 


52 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


Description of Artificial 


<400> 


142 



ggggaccact ttgtacaaga aagctgggtt ggattaagaa tgatgagact ca 



52 



<210> 143 

<211> 52 

<212> DNA 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 143 

ggggacaagt ttgtacaaaa aagcaggctt cacaatggcg aataatcctc eg 52 



<210> 144 

<211> 51 

<212> DNA 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence: primer 



<400> 144 

ggggaccact ttgtacaaga aagctgggtc actatcactc cccaacttct c 



51 
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<210> 145 

<211> 51 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<4 00> 145 

ggggacaagt ttgtacaaaa aagcaggctt cacaatggag ggttcgtcgt c 



51 



<210> 146 

<211> 49 

<212> DNA 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 146 

ggggaccact ttgtacaaga aagctgggtc caaaagaaga gcaacttca 



49 



<210> 14 7 

<211> 56 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 147 

ggggacaagt ttgtacaaaa aagcaggctt cacaatgtat tgctcttctt cgatgc 



56 



<210> 148 

<211> 52 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 148 

ggggaccact ttgtacaaga aagctgggtg cttggtgtca tcttgagaat ag 



52 



<210> 149 

<211> 54 

<212> DNA 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence: primer 



<400> 149 
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ggggacaagt ttgtacaaaa aagcaggctt cacaatggca aagatgcaat tatc 54 

<210> 150 

<211> 50 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 150 

ggggaccact ttgtacaaga aagctgggta accatctgat cacaagaaca 50 

<210> 151 

<211> 58 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 151 

ggggacaagt ttgtacaaaa aagcaggctt cacaatggct atttcaaaag ctcttatc 58 

<210> 152 

<211> 48 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 152 

ggggaccact ttgtacaaga aagctgggtg aggctagcgt agcactgg 4 8 

<210> 153 

<211> 54 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 153 

ggggacaagt ttgtacaaaa aagcaggctt cacaatgggg aagaagaaca agag 54 

<210> 154 

<211> 51 

<212> DNA 

<213> artificial sequence 



<220> 
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<223> Description of Artificial Sequence: primer 

<400> 154 

ggggaccact ttgtacaaga aagctgggtg cttctttgac tctttttatc g 51 

<210> 155 

<211> 55 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 155 

ggggacaagt ttgtacaaaa aagcaggctt cacaatggaa ttgcttgaca tgaac 55 

<210> 156 

<211> 53 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 156 

ggggaccact ttgtacaaga aagctgggtc aacattattc ttcttttctg gtc 53 

<210> 157 

<211> 54 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 157 

ggggacaagt ttgtacaaaa aagcaggctt cacaatggac gagggagtta tagc 54 

<210> 158 

<211> 50 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 158 

ggggaccact ttgtacaaga aagctgggtc cttagagaga ggacttttct 50 



<210> 159 
<211> 55 
<212> DNA 
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<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 159 - 
ggggacaagt ttgtacaaaa aagcaggctt cacaatggag ttgtttgtca ctcca ^ 

<210> 160 

<211> 49 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 160 

ggggaccact ttgtacaaga aagctgggtt cagcgagtat caatggatc «y 

<210> 161 

<211> 52 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 161 

ggggacaagt ttgtacaaaa aagcaggctt cacaatgcaa ccgacagaga eg 

<210> 162 

<211> 49 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 162 

ggggaccact ttgtacaaga aagctgggtg ctcgtccaac actaaggtt 

<210> 163 

<211> 55 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 163 

ggggacaagt ttgtacaaaa aagcaggctt cacaatgaat agggaaaagt tgatg 



WO 01/85946 PCI7IB01/01307 

101 



<210> 164 

<211> 47 

<212> DNA 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence : primer 
<4 00> 164 

ggggaccact ttgtacaaga aagctgggtc ctctaagaag cagcagc 47 



<210> 165 

<211> 52 

<212> DNA 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 165 

ggggacaagt ttgtacaaaa aagcaggctt cacaatggag gacgacgacg ag 52 



<210> 166 

<211> 50 

<212> DNA 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 166 

ggggaccact ttgtacaaga aagctgggtt gtcagctact tacattgccg 50 



<210> 167 

<211> 51 

<212> DNA 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 167 

ggggacaagt ttgtacaaaa aagcaggctt cacaatggcc accgtatctt c 51 



<210> 168 

<211> 48 

<212> DNA 

<213> artificial sequence 



<22Q> 

<223> Description of Artificial Sequence: primer 
<400> 168 



t 
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ggggaccact ttgtacaaga aagctgggtg 



<210> 169 

<211> 56 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial 

<400> 169 

ggggacaagt ttgtacaaaa aagcaggctt 



attagaaaac tgaaggcg 4 8 

Sequence: primer 

cacaatggat gtaggagtta ctacgg 5 6 



<210> 


170 


<211> 


47 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


Description of Artificial 


<400> 


170 



ggggaccact ttgtacaaga aagctgggtc taagccgagg cattgat 47 



<210> 171 

<211> 56 

<212> DNA 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 171 

ggggacaagt ttgtacaaaa aagcaggctt cacaatggat ggtcatgatt ctaagg 



<210> 172 

<211> 49 

<212> DNA 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 172 

ggggaccact ttgtacaaga aagctgggtt taagaggaac tagccggtg 



<210> 173 

<211> 52 

<212> DNA 

<213> artificial sequence 



<220> 
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<223> Description of Artificial Sequence: primer 
<400> 173 

ggggacaagt ttgtacaaaa aagcaggctt cacaatggaa atctacacca tg 52 



<210> 174 

<211> 51 

<212> DNA 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<4 00> 174 

ggggaccact ttgtacaaga aagctgggta actaaagaga actaagaaac t 51 



<210> 175 

<211> 53 

<212> DNA 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 175 

ggggacaagt ttgtacaaaa aagcaggctt cacaatggag tttggatctt ttc 53 



<210> 176 

<211> 47 

<212> DNA 

<213> artificial sequence ( 



<220> 
<223> 



Description of Artificial Sequence: primer 



<400> 176 

ggggaccact ttgtacaaga aagctgggtc tctcaagctt taaacgc 



47 



<210> 


177 


<211> 


54 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


Description of Artificial 


<4 00> 


177 



ggggacaagt ttgtacaaaa aagcaggctt cacaatggcg gagcagaaga gtac 



54 



<210> 178 
<211> 54 
<212> DNA 
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<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 178 

ggggaccact ttgtacaaga aagctgggtc ctatcatgtc gattttgtat cttt 54 

<210> 179 

<211> 51 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 179 

ggggacaagt ttgtacaaaa aagcaggctt cacaatggcg aatccttggt g 51 

<210> 180 

<211> 48 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 



48 



<400> 180 

ggggaccact ttgtacaaga aagctgggtt caatacgaag gaggagca 

<210> 181 

<211> 53 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 181 

ggggacaagt ttgtacaaaa aagcaggctt cacaatggct ctcaatctcc gtc 53 



<210> 182 

<211> 54 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 182 

ggggaccact ttgtacaaga aagctgggtg gattagagag tcatatgttt gatg 



54 
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<210> 183 

<211> 52 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 183 

ggggacaagt ttgtacaaaa aagcaggctt cacaatgctg atgctgtgtg gg 52 

<210> 184 

<211> 52 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 184 

ggggaccact ttgtacaaga aagctgggtt ttcaacaatg ttcaacaaca ct 52 

<210> 185 

<211> 54 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 185 

ggggacaagt ttgtacaaaa aagcaggctt cacaatggtg aagttgatga tacg 54 

<210> 186 

<211> 49 

<212> DNA 

<213> artificial sequence 

<220> 

<223> Description of Artificial Sequence: primer 

<400> . 186 

ggggaccact ttgtacaaga aagctgggtt ttagtgcaac caaagagtc 4 9 

<210> 187 

<211> 56 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 187 
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ggggacaagt ttgtacaaaa aagcaggctt cacaatggag aaacagagta ctcaac 56 

<210> 188 

<211> 51 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 188 

ggggaccact ttgtacaaga aagctgggtt tatgaaggtc tctttgtagc t 51 

<210> 189 

<211> 58 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 189 

ggggacaagt ttgtacaaaa aagcaggctt cacaatgaca actactgggt ctaattct 58 

<210> 190 

<211> 47 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 190 

ggggaccact ttgtacaaga aagctgggtt caattctccg gcttcat 47 

<210> 191 

<211> 52 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 191 

ggggacaagt ttgtacaaaa aagcaggctt cacaatggcg ctgcagaaca tt 52 

<210> 192 

<211> 51 

<212> DNA 

<213> artificial sequence 



<220> 
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<223> Description of Artificial Sequence: primer 
<4 00> 192 

ggggaccact ttgtacaaga aagctgggtg caaagaaaag ttaggaggga a 



:51 



<210> 193 

<211> 53 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 193 

ggggacaagt ttgtacaaaa aagcaggctt cacaatggcg gctaacaaat teg 



53 



<210> 194 

<211> 49 

<212> DNA 

<213> artificial sequence 



<220> 
<223> 



Description of Artificial Sequence : primer 



<400> 194 

ggggaccact ttgtacaaga aagctgggtg tcgttgttcc ttgcctcac 



49 



<210> 195 

<211> 57 

<212> DNA 

<213> artificial sequence 



<220> 
<223> 



Description of Artificial Sequence: primer 



<400> 195 

ggggacaagt ttgtacaaaa aagcaggctt cacaatgtca ctcttgttcc tcaatcc 



57 



<210> 196 

<211> 54 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 196 

ggggaccact ttgtacaaga aagctgggtc cttctaccga tttctgtttc ttat 54 



<210> 197 
<211> 53 
<212> DNA 



ierv-v>tr»- --wn 
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<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 197 

ggggacaagt ttgtacaaaa aagcaggctt cacaatggaa ggttcctcgt cag 



<210> 


198 


<211> 


48 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


Description of Artificial 


<400> 


198 



ggggaccact ttgtacaaga aagctgggtt cacgatgtaa tcatgggc 



48 



<210> 199 
<211> 12 
<212> PRT 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: motif 
<400> 199 

Glu Glu Thr Ala Arg Phe Gin Pro Gly Tyr Arg Ser 
15 10 



<210> 


200 


<211> 


10 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


Description of Artificial 


<400> 


200 



Glu Gin Lys Leu lie Ser Glu Glu Asp Leu 



10 



<210> 


201 


<211> 


7 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


Description of Artificial 


<400> 


201 



Asp Tyr Lys Asp Asp Asp Lys 
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<210> 


202 


<211> 


9 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


Description of Artificial Sequence: motif 


<400> 


202 


Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 


1 


5 


<210> 


203 


<211> 


12 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


Description of Artificial Sequence: motif 


<4 00> 


203 


Glu Asp Gin Val Asp Pro Arg Leu lie Asp Gly Lys 


1 


5 10 


<210> 


204 


<211> 


11 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


Description of Artificial Sequence: motif 


<400> 


204 


Tyr Thr Asp lie Glu Met Asn Arg Leu Gly Lys 


1 


5 10 



<210> 205 

<211> 131 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 205 












Asn 


Arg lie 


Leu 


Trp 


Lys 


Gly Val 


1 






5 








Ala 


Asp Val 


Ser 


Val 


Leu 


Gin 


Leu 






20 










Leu 


Glu Glu 


Gin 


Ala 


Leu 


Asp 


Asn 




35 










40 


Leu 


Arg Asp 


Leu 


Ser 


Glu 


Asn 


Glu 




50 








55 




Thr 


Glu Glu 




He 


Lys 


Ser 


Leu 



Asp 


Ala 


Cys 


Pro 


Gly Asp 


Glu 


Asp 




10 










15 




Gin 


Ala 


Glu 


He 


Glu 


Asn 


Leu 


Ala 


25 










30 






Gin 


He 


Arg 


Gin 


Thr 
45 


Glu 


Glu 


Arg 


Lys 


Asn 


Gin 


Lys 
60 


Trp 


Leu 


Phe 


Val 


Pro 


Gly 


Phe 


Gin 


Asn 


Gin 


Thr 


Leu 
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65 70 
lie Ala Val Lys Ala Pro His Gly 
85 

Asp Glu Ala Ala Asp His Pro Gin 
100 

Ser Thr Met Gly Pro lie Asp Val 
115 120 
Lys Phe Glu 
130 



75 80 
Thr Thr Leu Glu Val Pro Asp Pro 

90 95 
Arg Arg Tyr Arg lie lie Leu Arg 
105 110 
Tyr Leu Val Ser Glu Phe Glu Gly 
125 



<210> 206 

<211> 385 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 206 



Met 


Ser 


Glu 


Glu 


1 








His 


Pro 


Ser 


Leu 








20 


Tyr 


His 


Arg 


Phe 






35 




Asp 


Gin 


Val 


Val 




50 






Lys 


Thr 


Asp 


Leu 


65 








Gly 


Val 


Leu 


Gin 


Thr 


Ser 


Arg 


Ser 








100 


Ser 


Asn 


Ala 


Gly 






115 




Arg 


Tyr 


Asp 


Ser 




130 






lie 


Lys 


Gin 


Ala 


145 








Thr 


Leu 


Glu 


Val 


Glu 


Gly 


lie 


Gly 








180 


Lys 


Gly 


Leu 


Asp 






195 




Asn 


Leu 


Gin 


Asp 




210 






Asp 


Asp 


Gin 


He 


225 








Asp 


Glu 


Asn 


Asn 


Asn 


Leu 


Pro 


Cys 








260 


His 


Gly 


Thr 


Thr 






275 




Gin 


Arg 


Arg 


Tyr 




290 






Val 


Tyr 


Leu 


Val 



305 



Val 


Pro 


Gin 


Gin 


5 








Ser 


Ser 


Met 


Lys 


Asp 


Ala 


Ala 


Glu 








40 


Ser 


Asp 


Ala 


He 






55 




T 7 — . "J 

vaj. 


Asn 


tjj_n 


lie 




70 






Thr 


Pro 


Val 


Ser 


85 








Ala 


Lys 


Ser 


Asn 


Ser 


Pro 


Gly 


Asn 








120 


Ser 


Leu 


Gly 


Leu 






135 




Glu 


Asp 


Gly 


He 




150 






Gin 


Lys 


Arg 


Arg 


165 








Leu 


He 


Glu 


Lys 


Val 


Ser 


Lys 


Pro 








200 


Glu 


Val 


Gin 


Asn 






215 




Arg 


Glu 


Ser 


Gin 




230 






Lys 


Arg 


Leu 


Leu 


245 








Phe 


Gin 


Asn 


Lys 


Leu 


Glu 


Val 


Pro 








280 


Arg 


He 


He 


Leu 






295 




Ser 


Gin 


Phe 


Glu 




310 







Phe Pro Ser Ser 
10 

Pro Pro Leu Val 
25 

Thr Arg Gly Gly 

Val He Lys Ser 
60 

Val Glu Val Asn 
75 

Gly Lys Gly Gly 
90 

Lys Ser Gly Thr 
105 

Asn Phe Ala Gin 

Leu Thr Lys Lys 
140 

Leu Asp Leu Asn 

155 

He Tyr Asp He 
170 

Thr Leu Lys Asn 
185 

Gly Glu Thr He 

Leu Ala Ala Glu 
220 

Glu Arg Leu Thr 
235 

Phe Val Thr Glu 
250 

Thr Leu He Ala 
265 

Asp Pro Asp Glu 

Arg Ser Thr Met 
300 

Glu Ser Phe Glu 
315 



Lys Arg Gin Leu 
15 

Ala Pro Gly Glu 
30 

Gly Ala Val Ala 
45 

Thr Leu Lys Arg 

Glu Leu Asn Thr 
80 

Lys Ala Lys Lys 
95 

Leu Ala Ser Gly 
110 

Ala Gly Thr Cys 
125 

Phe lie Asn Leu 

Lys Ala Ala Asp 
160 

Thr Asn Val Leu 
175 

Arg He Gin Trp 
190 

Glu Ser He Ala 
205 

Glu Ala Arg Leu 

Ser Leu Ser Glu 
240 

Asn Asp lie Lys 
255 

Val Lys Ala Pro 
270 

Ala Gly Gly Tyr 
285 

Gly Pro lie Asp 

Asp He Pro Gin 
320 
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Ala Asp Glu Pro Ser Asn Val Pro 

325 

Glu Pro Ser Asn Leu Pro Ser Thr 
340 

Val Ser Met Pro Met Lys Glu Glu 
355 3 60 

Gin Glu Val Asp Asp Thr Gin Arg 
370 375 

Asp 
385 



Asp Glu Pro Ser Asn Val Pro Asp 

330 335 
Ser Gly Leu Pro Glu Asn His Asp 
345 " 350 
Ser Thr Glu Arg Asn Met Glu Thr 
365 

Val Tyr Ser Asp lie Glu Ser His 
380 



<210> 207 

<211> 127 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 207 



Met 


Ser 


Glu 


Glu 


Val 


Pro 


Gin 


Gin 


Phe 


Pro 


Ser 


Ser Lys 


Arg 


Gin 


Leu 


1 








5 










10 








15 




His 


Pro 


Ser 


Leu 


Ser 


Ser 


Met 


Lys 


Pro 


Pro 


Leu 


Val Ala 


Pro 


Gly 


Glu 








20 










25 








30 






Tyr 


His 


Arg 


Phe 


Asp 


Ala 


Ala 


Glu 


Thr 


Arg 


Gly 


Gly Gly 


Ala 


Val 


Ala 






35 










40 








45 








Asp 


Gin 


Val 


Val 


Ser 


Asp 


Ala 


He 


Val 


He 


Lys 


Ser Thr 


Leu 


Lys 


Arg 




50 










55 










60 








Lys 


Thr 


Asp 


Leu 


Val 


Asn 


Gin 


He 


Val 


Glu 


Val 


Asn Glu 


Leu 


Asn 


Thr 


65 










70 










75 








80 


Gly 


Val 


Leu 


Gin 


Thr 


Pro 


Val 


Ser 


Gly 


Lys 


Gly 


Gly Lys 


Ala 


Lys 


Lys 










85 










90 








95 




Thr 


Ser 


Arg 


Ser 


Ala 


Lys 


Ser 


Asn 


Lys 


Ser 


Gly 


Thr Leu 


Ala 


Ser 


Gly 








100 










105 








110 






Ser 


Asn 


Ala 


Gly 


Ser 


Pro 


Gly Asn 


Asn 


Phe 


Ala 


Gin Ala 


Gly 


Thr 








115 










120 








125 









<210> 208 

<211> 142 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 208 



Met 


Ser 


Met 


Glu 


Met 


Glu 


Leu 


Phe 


Val 


Thr 


Pro 


Glu 


Lys 


Gin 


Arg 


Gin 


1 








5 










10 










15 




His 


Pro 


Ser 


Val 


Ser 


Val 


Glu 


Lys 


Thr 


Pro 


Val 


Arg Arg 


Lys 


Leu 


He 








20 










25 










30 






Val 


Asp 


Asp 
35 


Asp 


Ser 


Glu 


He 


Gly 
40 


Ser 


Glu 


Lys 


Lys 


Gly 
45 


Gin 


Ser 


Arg 


Thr 


Ser 
50 


Gly 


Gly 


Gly 


Leu 


Arg 
55 


Gin 


Phe 


Ser 


Val 


Met 
60 


Val 


Cys 


Gin 


Lys 


Leu 


Glu 


Ala 


Lys 


Lys 


He 


Thr 


Thr 


Tyr 


Lys 


Glu 


Val 


Ala 


Asp 


Glu 


He 


65 










70 










75 










80 


He 


Ser 


Asp 


Phe 


Ala 
85 


Thr 


He 


Lys 


Gin 


Asn 
90 


Ala 


Glu 


Lys 


Pro 


Leu 
95 


Asn 


Glu 


Asn 


Glu 


Tyr 
100 


Asn 


Glu 


Lys 


Asn 


He 
105 


Arg 


Arg 


Arg 


Val 


Tyr 
110 


Asp 


Ala 


Leu 


Asn 


Val 


Phe 


Met 


Ala 


Leu 


Asp 


lie 


He 


Ala 


Arg 


Asp 


Lys 


Lys 


Glu 
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115 120 125 

lie Arg Trp Lys Gly Leu Pro lie Thr Cys Lys Lys Asp Val 
130 " 135 140 



<210> 209 

<211> 101 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 209 



Glu 


Lys 


Lys 


Gly 


Gin 


Ser 


Arg 


Thr 


Ser 


Gly Gly Gly 


Leu 


Arg Gin 


Phe 


1 


5 










10 






15 




Ser 


Val 


Met 


Val 


Cys 


Gin 


Lys 


Leu 


Glu 


Ala 


Lys Lys 


He 


Thr Thr 


Tyr 








20 








25 








30 




Lys 


Glu 


Val 


Ala 


Asp 


Glu 


He 


He 


Ser 


Asp 


Phe Ala 


Thr 


He Lys 


Gin 




35 










40 








45 






Asn 


Ala 


Glu 


Lys 


Pro 


Leu 


Asn 


Glu 


Asn 


Glu 


Tyr Asn 


Glu 


Lys Asn 


He 




50 








55 








60 








Arg 


Arg 


Arg 


Val 


Tyr 


Asp 


Ala 


Leu 


Asn 


Val 


Phe Met 


Ala 


Leu Asp 


He 


65 








70 










75 






80 


He 


Ala 


Arg 


Asp 


Lys 


Lys 


Glu 


He 


Arg 


Trp 


Lys Gly 


Leu 


Pro He 


Thr 






85 










90 






95 




Cys 


Lys 


Lys 


Asp 
100 


Val 





















<210> 210 

<211> 251 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 210 



Glu 


Lys 


Lys 


Gly 


Gin 


Ser 


Arg 


Thr 


Ser 


Gly Gly Gly 


Leu 


Arg 


Gin 


Phe 


1 


5 










10 










15 




Ser 


Val 


Met 


Val 


Cys 


Gin 


Lys 


Leu 


Glu 


Ala 


Lys 


Lys 


He 


Thr 


Thr 


Tyr 








20 








25 










30 






Lys 


Glu 


Val 


Ala 


Asp 


Glu 


He 


He 


Ser 


Asp 


Phe 


Ala 


Thr 


He 


Lys 


Gin 




35 










40 










45 








Asn 


Ala 


Glu 


Lys 


Pro 


Leu 


Asn 


Glu 


Asn 


Glu 


Tyr 


Asn 


Glu 


Lys 


Asn 


He 




50 








55 










60 










Arg 


Arg 


Arg 


Val 


Tyr 


Asp 


Ala 


Leu 


Asn 


Val 


Phe 


Met 


Ala 


Leu 


Asp 


lie 


65 








70 










75 










80 


He 


Ala 


Arg 


Asp 


Lys 


Lys 


Glu 


He 


Arg 


Trp 


Lys 


Gly 


Leu 


Pro 


He 


Thr 






85 










90 










95 




Cys 


Lys 


Lys 


Asp 


Val 


Glu 


Glu 


Val 


Lys 


Met 


Asp Arg 


Asn 


Lys 


Val 


Met 


100 










105 










110 






Ser 


Ser 


Val 


Gin 


Lys 


Lys 


Ala 


Ala 


Phe 


Leu 


Lys 


Glu 


Leu Arg 


Glu 


Lys 






115 








120 










125 








Val 


Ser 
130 


Ser 


Leu 


Glu 


Ser 


Leu 
135 


Met 


Ser 


Arg 


Asn 


Gin 
140 


Glu 


Met 


Val 


Val 


Lys 


Thr 


Gin 


Gly 


Pro 


Ala 


Glu 


Gly 


Phe 


Thr 


Leu 


Pro 


Phe 


He 


Leu 


Leu 


145 








150 










155 










160 


Glu 


Thr 


Asn 


Pro 


His 
165 


Ala 


Val 


Val 


Glu 


He 
170 


Glu 


He 


Ser 


Glu 


Asp 
175 


Met 


Gin 


Leu 


Val 


His 


Leu 


Asp 


Phe 


Asn 


Ser 


Thr 


Pro 


Phe 


Ser 


Val 


His 


Asp 



180 185 190 
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Asp Ala Tyr 


He Leu 


Lys 


Leu 


Met 


Gin 


Glu 


Gin 


Lys 


Gin 


Glu 


Gin 


Asn 


195 








200 










205 








Arg Val Ser 


Ser Ser 


Ser 


Ser 


Thr 


His 


His 


Gin 


Ser 


Gin 


His 


Ser 


Ser 


210 






215 










220 










Ala His Ser 


Ser Ser 


Ser 


Ser 


Cys 


lie 


Ala 


Ser 


Gly 


Thr 


Ser 


Gly 


Pro 


225 




230 










235 










24 0 


Val Cys Trp 


Asn Ser 


Gly 


Ser 


He 


Asp 


Thr 


Arg 














245 










250 














<210> 211 


























<211> 172 


























<212> PRT 


























<213> Arabidopsis thaliana 




















<400> 211 


























lie lie Ala 


Arg Asp 


Lys 


Lys 


Glu 


He 


Arg 


Trp 


Lys 


Gly 


Leu 


Pro 


He 


1 


5 










10 










15 




Thr Cys Lys 


Lys Asp 


val 


GJ-U 


Glu 


Val 


Lys 


Met 


Asp 


Arg 


Asn 


Lys 


Val 




o n 
20 








25 










30 






Met Ser Ser 


Val Gin 


Lys 


Lys 


Ala 


Ala 


Phe 


Leu 


Lys 


Glu 


Leu 


Arg 


Glu 


35 








40 










45 








Lys Val Ser 


Ser Leu 


Glu 


Ser 


Leu 


Met 


Ser 


Arg 


Asn 


Gin 


Glu 


Met 


Val 


50 






55 










60 










Val Lys Thr 


Gin Gly 


Pro 


Ala 


Glu 


Gly 


Phe 


Thr 


Leu 


Pro 


Phe 


lie 


Leu 


65 




70 










75 










80 


Leu Glu Thr 


Asn Pro 


His 


Ala 


Val 


Val 


Glu 


He 


Glu 


He 


Ser 


Glu 


Asp 




85 










90 










95 




Met Gin Leu 


Val His 


Leu 


Asp 


Phe 


Asn 


Ser 


Thr 


Pro 


Phe 


Ser 


Val 


His 




100 








105 










110 






Asp Asp Ala 


Tyr He 


Leu 


Lys 


Leu 


Met 


Gin 


Glu 


Gin 


Lys 


Gin 


Glu 


Gin 


115 








120 










125 








Asn Arg Val 


Ser Ser 


Ser 


Ser 


Ser 


Thr 


His 


His 


Gin 


Ser 


Gin 


His 


Ser 


130 






135 










140 










Ser Ala His 


Ser Ser 


Ser 


Ser 


Ser 


Cys 


lie 


Ala 


Ser 


Gly 


Thr 


Ser 


Gly 


145 




150 










155 










160 


Pro Val Cys 


Trp Asn 


Ser 


Gly 


Ser 


lie 


Asp 


Thr 


Arg 












165 










170 














<210> 212 


























<211> 93 


























<212> PRT 


























<213> Arabidopsis thaliana 




















<400> 212 


























He He Ala 


Arg Asp Lys 


Lys 


Glu 


He 


Arg 


Trp 


Lys 


Gly 


Leu 


Pro 


lie 


1 


5 










10 










15 




Thr Cys Lys 


Lys Asp 


Val 


Glu 


Glu 


Val 


Lys 


Met 


Asp 


Arg 


Asn 


Lys 


Val 




20 








25 










30 






Met Ser Ser 


Val Gin Lys 


Lys 


Ala 


Ala 


Phe 


Leu 


Lys 


Glu 


Leu 


Arg 


Glu 


35 








40 










45 








Lys Val Ser 


Ser Leu 


Glu 


Ser 


Leu 


Met 


Ser 


Arg 


Asn 


Gin 


Glu 


Met 


Val 


50 






55 










60 










Val Lys Thr 


Gin Gly 


Pro 


Ala 


Glu 


Gly 


Phe 


Thr 


Leu 


Pro 


Phe 


lie 


Leu 


65 


70 










75 










80 


Leu Glu Thr 


Asn Pro 


His 


Ala 


Val 


Val 


Glu 


He 


Glu 


He 
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<210> 213 

<211> 121 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 213 



Ser 


Leu 


Glu 


Ser 


Leu 


Met 


Ser 


Arg 


Asn 


1 

Gin 


Gly 


Pro 


Ala 


5 

Glu 


Gly 


Phe 


Thr 


Leu 






20 










25 


Asn 


Pro 


His 


Ala 


Val 


Val 


Glu 


He 


Glu 






35 










40 




Val 


His 


Leu 


Asp 


Phe 


Asn 


Ser 


Thr 


Pro 




50 










55 






Tyr 


He 


Leu 


Lys 


Leu 


Met 


Gin 


Glu 


Gin 


65 










70 








Ser 


Ser 


Ser 


Ser 


Ser 


Thr 


His 


His 


Gin 










85 










Ser 


Ser 


Ser 


Ser 


Ser 


Cys 


He 


Ala 


Ser 








100 










105 


Trp 


Asn 


Ser 


Gly 


Ser 


lie 


Asp 


Thr 


Arg 




115 










120 





<210> 214 

<211> 193 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 214 














Met 


Thr 


Thr 


Thr 


Gly 
5 


Ser 


Asn 


Ser 


Asn 


1 

Asn 


Asn 


Asn 


Asn 


Asn 


Pro 


Ser 


Thr 


Arg 








20 










25 


Gly 


Gin 


Ser 


Val 


Ser 


Thr 


Ser 


Gly 


Ser 




35 










40 




Ser 


Glu 


Gin 


Thr 


He 


Thr 


Val 


Val 


Thr 




50 










55 






Gin 


Arg 


Leu 


Asn 


Asn 


Leu 


Asp 


He 


Gin 


65 










70 








Gly 


Ala 


Ser 


Gly 


Val 


Lys 


Lys 


Lys 


Lys 










85 










Pro 


Asp 


Lys 


Thr 


Gly 


Arg 


Gly 


Leu 


Arg 






100 










105 


Glu 


Lys 


Val 


Glu 


Ser 


Lys 


Gly Arg 


Thr 




115 










120 




Glu 


Leu 


Val 


Ala 


Glu 


Phe 


Ala 


Leu 


Pro 




130 










135 






Asp 


Gin 


Gin 


Gin 


Tyr 


Asp 


Glu 


Lys 


Asn 


145 










150 








Ala 


Leu 


Asn 


Val 


Leu 


Met 


Ala 


Met 


Asp 










165 










Glu 


He 


Gin 


Trp 


Arg 


Gly 


Leu 


Pro Arg 








180 










185 



PCT7EB01/01307 

114 



90 



Gin Glu Met Val Val Lys Thr 
10 15 
Pro Phe He Leu Leu Glu Thr 
30 

He Ser Glu Asp Met Gin Leu 
45 

Phe Ser Val His Asp Asp Ala 
60 

Lys Gin Glu Gin Asn Arg Val 

75 80 
Ser Gin His Ser Ser Ala His 
90 95 
Gly Thr Ser Gly Pro Val Cys 
110 



His Asn His His Glu Ser Asn 
10 15 
Ser Trp Gly Thr Ala Val Ser 
30 

Met Gly Ser Pro Ser Ser Arg 
45 

Ser Thr Ser Asp Thr Thr Phe 
60 

Gly Asp Asp Ala Gly Ser Gin 

75 80 
Arg Gly Gin Arg Ala Ala Gly 
90 95 
Gin Phe Ser Met Lys Val Cys 
110 

Thr Tyr Asn Glu Val Ala Asp 
125 

Asn Asn Asp Gly Thr Ser Pro 
140 

He Arg Arg Arg Val Tyr Asp 
155 160 
He He Ser Lys Asp Lys Lys 
170 175 
Thr Ser Leu Ser Asp He Glu 
190 
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Glu 



<210> 215 








<211> 81 








<212> PRT 








<213> Arabidopsis thaliana 




<400> 215 








Gly Leu Pro Arg 


Thr 


Ser Leu 


Ser 


1 


5 






Arg Leu Ser Leu 


Arg 


Asn Arg 


lie 


20 








Glu Leu Glu Glu 


Gin 


Tyr Val 


Gly 


35 






40 


Glu His Leu Tyr 


Ser 


Ser Gly Asn 


50 




55 




Pro Phe lie Leu 


Val 


Gin Thr 


Arg 


65 




70 




He 









Asp He Glu Glu Leu Lys Asn Glu 

10 15 
Glu Lys Lys Thr Ala Tyr Ser Gin 
25 ~ 30 
Leu Gin Asn Leu lie Gin Arg Asn 
4 5 

Ala Pro Ser Gly Gly Val Ala Leu 
60 

Pro His Ala Thr Val Glu Val Glu 
75 80 



<210> 216 

<211> 204 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 216 



Gly 


Leu 


Pro 


Arg 


Thr 


Ser 


Leu 


Ser 


Asp 


He 


Glu 


Glu 


Leu 


Lys 


Asn 


Glu 


1 








5 










10 










15 




Arg 


Leu 


Ser 


Leu 


Arg 


Asn 


Arg 


He 


Glu 


Lys 


Lys 


Thr 


Ala 


Tyr 


Ser 


Gin 






20 










25 










30 






Glu 


Leu 


Glu 


Glu 


Gin 


Tyr 


Val 


Gly 


Leu 


Gin 


Asn 


Leu 


He 


Gin Arg Asn 






35 










40 










45 








Glu 


His 


Leu 


Tyr 


Ser 


Ser 


Gly 


Asn 


Ala 


Pro 


Ser 


Gly 


Gly 


Val 


Ala 


Leu 




50 








55 










60 










Pro 


Phe 


He 


Leu 


Val 


Gin 


Thr 


Arg 


Pro 


His 


Ala 


Thr 


Val 


Glu 


Val 


Glu 


65 










70 










75 










80 


He 


Ser 


Glu 


Asp 


Met 


Gin 


Leu 


Val 


His 


Phe 


Asp 


Phe 


Asn 


Ser 


Thr 


Pro 








85 










90 










95 




Phe 


Glu 


Leu 


His 
100 


Asp 


Asp 


Asn 


Phe 


Val 
105 


Leu 


Lys 


Thr 


Met 


Lys 
110 


Phe 


Cys 


Asp 


Gin 


Pro 


Pro 


Gin 


Gin 


Pro 


Asn 


Gly Arg 


Asn 


Asn 


Ser 


Gin 


Leu 


Val 




115 










120 










125 








Cys 


His 


Asn 


Phe 


Thr 


Pro 


Glu 


Asn 


Pro 


Asn 


Lys 


Gly 


Pro 


Ser 


Thr 


Gly 


130 










135 










140 










Pro 


Thr 


Pro 


Gin 


Leu 


Asp 


Met 


Tyr 


Glu 


Thr 


His 


Leu 


Gin 


Ser 


Gin 


Gin 


145 










150 










155 










160 


His 


Gin 


Gin 


His 


Ser 
165 


Gin 


Leu 


Gin 


He 


He 
170 


Pro 


Met 


Pro 


Glu 


Thr 
175 


Asn 


Asn 


Val 


Thr 


Ser 
180 


Ser 


Ala 


Asp 


Thr 


Ala 
185 


Pro 


Val 


Lys 


Ser 


Pro 
190 


Ser 


Leu 


Pro 


Gly 


He 


Met 


Asn 


Ser 


Ser 


Met 


Lys 


Pro 


Glu 


Asn 











195 200 
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<210> 217 

<211> 420 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 217 



Met 


Ser 


Gly 


Val 


Val 


Arg 


Ser 


Ser 


1 








5 








Pro 


Pro 


Pro 


His 


His 


Pro 


Pro 


Ser 








20 










Pro 


Val 


He 


Pro 


Pro 


He 


Arg 


Arg 






35 










40 


Pro 


Pro 


Phe 


His 


Pro 


Ser 


Asp 


Asp 




50 










55 




Leu 


Ser 


Asn 


Asn 


Asn 


Asp 


Arg 


Ser 


65 










70 






Asp 


Arq 


Glu 


Glu 


Asp 


Ala 


Val 


Val 










85 








Lvs 


Ala 


Thr 


Met 


Asp 


Met 


Val 


Val 








100 










Ser 


Ser 


Gly 


Phe 


Thr 


Asn 


He 


Pro 






115 










120 


Lvs 


Glv 


Gly 


Ara 


Val 


Asn 


He 


Lys 




130 










135 




Thr 


Pro 


Gin 


Thr 


Pro 


He 


Ser 


Thr 


145 










150 






Thr 


Pro 


Ser 


Glv 


Ser 


Cys 


Arg 


Tyr 










165 








Lvs 


Lvs 


Phe 


Val 


Asn 


Leu 


He 


Lys 








180 










Leu 


Asn 


Lvs 


Ala 


Ala 


Glu 


Thr 


Leu 






195 










200 


Asp 


He 


Thr 


Asn 


Val 


Leu 


Glu 


Gly 




210 










215 




Lvs 


Asn 


Arg 


He 


Leu 


Trp 


Lys 


Gly 


225 










230 






Asp 


Ala 


Asp 

XT 


Val 


Ser 


Val 


Leu 


Gin 










245 








Ala 


Leu 


Glu 


Glu 


Gin 


Ala 


Leu 


Asp 








260 










Arg 


Leu 


Arg 


Asp 


Leu 


Ser 


Glu 


Asn 






275 










280 


Val 


Thr 


Glu 


Glu 


Asp 


He 


Lys 


Ser 




290 










295 




Leu 


He 


Ala 


Val 


Lys 


Ala 


Pro 


His 


305 










310 






Pro 


Asp 


Glu 


Ala 


Ala 


Asp 


His 


Pro 










325 








Arg 


Ser 


Thr 


Met 


Gly 


Pro 


He 


Asp 








340 










Gly 


Lys 


Phe 


Glu 


Asp 


Thr 


Asn 


Gly 






355 










360 


Leu 


Pro 


He 


Ala 


Ser 


Ser 


Ser 


Gly 




370 










375 




Ala 


Leu 


Thr 


Val 


Asp 


Asn 


Pro 


Glu 


385 










390 







Pro 


Gly 


Ser 


Ser 


Gin 


Pro 


Pro 


Pro 




10 










15 




Ser 


Pro 


Val 


Pro 


Val 


Thr 


Ser 


Thr 


25 










30 






His 


Leu 


Ala 


Phe 


Ala 


Ser 


Thr 


Lys 










45 








Tyr 


His 


Arg 


Phe 


Asn 


Pro 


Ser 


Ser 








60 










Phe 


Val 


His 


Gly 


Cys 


Gly Val 


Val 






75 










80 


Val 


Arg 


Ser 


Pro 


Ser Arg Lys 


Arg 




90 










95 




Ala 


Pro 


Ser 


Asn 


Asn 


Gly 


Phe 


Thr 


105 










110 






Ser 


Ser 


Pro 


Cys 


Gin 


Thr 


Pro 


Arg 










125 








Ser 


Lvs 

J; 


Ala 


Lys 


Gly 


Asn 


Lvs 


Ser 








140 










Asn 


Ala 


Gly 


Ser 


Pro 


He 


Thr 


Leu 






155 










160 


Asp 


Ser 


Ser 


Leu 


Gly Leu 


Leu 


Thr 




170 










175 




Gin 


Ala 


Lys 


Asp 


Gly 


Met 


Leu 


Asp 


185 










190 






Glu 


Val 


Gin 


Lvs 


Arg 


Arg 


He 


Tyr 










205 








He 


Asp 


Leu 


lie 


Glu 


Lys 


Pro 


Phe 








220 










Val 


Asp 


Ala 


Cys 


Pro 


Gly 


Asp 


Glu 






235 










240 


Leu 


Gin 


Ala 


Glu 


He 


Glu 


Asn 


Leu 




250 










255 




Asn 


Gin 


He 


Arg 


Gin 


Thr 


Glu 


Glu 


265 










270 






Glu 


Lvs 


Asn 


Gin 


Lys 


Trp 


Leu 


Phe 










285 








Leu 


Pro 


Gly 


Phe 


Gin 


Asn 


Gin 


Thr 








300 










Gly 


Thr 


Thr 


Leu 


Glu 


Val 


Pro 


Asp 






315 










320 


Gin 


Arg 


Arg 


Tyr 


Arg 


lie 


He 


Leu 




330 










335 




Val 


Tyr 


Leu 


Val 


Ser 


Glu 


Phe 


Glu 


345 










350 






Ser 


Gly 


Ala 


Ala 


Pro 


Pro 


Ala 


Cys 










365 








Ser 


Thr 


Gly 


His 


His 


Asp 


He 


Glu 








380 










Thr 


Ala 


He 


Val 


Ser 


His 


Asp 


His 






395 










400 
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Pro His Pro Gin Pro Gly Asp Thr 
405 

Gin Val Gly Gly 
420 



<210> 218 

<211> 324 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 218 



Pro 


Ser 


Gly 


Ser 


Cys 


Arg 


Tyr 


Asp 


1 








5 








Lys 


Phe 


Val 


Asn 


Leu 


He 


Lys 


Gin 








20 










Asn 


Lys 


Ala 


Ala 


Glu 


Thr 


Leu 


Glu 




35 










40 


lie 


Thr 


Asn 


Val 


Leu 


Glu 


Gly 


He 




50 










55 




Asn 


Arg 


He 


Leu 


Trp 




Gly 


Val 


65 










70 






Ala 


Asp 


Val 


Ser 


Val 


Leu 


Gin 


Leu 










85 








Leu 


Glu 


Glu 


Gin 


Ala 


Leu 


Asp 


Asn 








100 










Leu 


Arg 


Asp 


Leu 


Ser 


Glu 


Asn 


Glu 






115 










120 


Thr 


Glu 


Glu 


Asp 


lie 


Lys 


Ser 


Leu 




130 










135 




He 


Ala 


Val 


Lys 


Ala 


Pro 


His 


Gly 


145 










150 






Asp 


Glu 


Ala 


Ala 


Asp 


His 


Pro 


Gin 










165 








Ser 


Thr 


Met 


Gly 


Pro 


He 


Asp 


Val 








180 










Lys 


Phe 


Glu 


Asp 


Thr 


Asn 


Gly 


Ser 






195 










200 


Pro 


He 


Ala 


Ser 


Ser 


Ser 


Gly 


Ser 




210 










215 




Leu 


Thr 


Val 


Asp 


Asn 


Pro 


Glu 


Thr 


225 










230 






His 


Pro 


Gin 


Pro 


Gly 


Asp 


Thr 


Ser 










245 








Val 


Gly 


Gly 


Met 


Leu 


Lys 


He 


Thr 








260 










Ser 


Asp 


Tyr 


Trp 


Leu 


Leu 


Ser 


Asn 






275 










280 


Trp 


Lys 


Thr 


Asp 


Ser 


Gly 


He 


Asp 




290 










295 




Ser 


Thr 


Pro 


Pro 


Pro 


Gly 


Met 


Gly 


305 










310 






Ser 


Thr 


Pro 


Arg 











Ser 


Asp 


Leu 


Asn 


Tyr 


Leu 


Gin 


Glu 




410 










415 




Ser 


Ser 


Leu 


Gly 


Leu 


Leu 


Thr 


Lys 




10 










15 




Ala 


Lys 


Asp 


Gly 


Met 


Leu 


Asp 


Leu 


25 










30 






Val 


Gin 


i*y s 


Arg 


Arg 


He 


Tyr 


Asp 










45 








Asp 


Leu 


He 


Glu 


Lys 


Pro 


Phe 


Lys 








60 










Asp 


Ala 


Cys 


Pro 


Gly 


Asp 


Glu 


Asp 






75 










80 


Gin 


Ala 


Glu 


He 


Glu 


Asn 


Leu 


Ala 




90 










95 




Gin 


He 


Arg 


Gin 


Thr 


Glu 


Glu 


Arg 


105 










110 






Lys 


Asn 


Gin 


Lys 


Trp 


Leu 


Phe 


Val 










125 








Pro 


Gly 


Phe 


Gin 


Asn 


Gin 


Thr 


Leu 








140 










Thr 


Thr 


Leu 


Glu 


Val 


Pro 


Asp 


Pro 






155 










160 


Arg 


Arg 


Tyr 


Arg 


He 


He 


Leu 


Arg 




170 










175 




Tyr 


Leu 


Val 


Ser 


Glu 


Phe 


Glu 


Gly 


185 










190 






Gly 


Ala 


Ala 


Pro 


Pro 


Ala 


Cys 


Leu 










205 








Thr 


Gly 


His 


His 


Asp 


He 


Glu 


Ala 








220 










Ala 


He 


Val 


Ser 


His 


Asp 


His 


Pro 






235 










240 


Asp 


Leu 


Asn 


Tyr 


Leu 


Gin 


Glu 


Gin 




250 










255 




Pro 


Ser 


Asp 


Val 


Glu 


Asn 


Asp 


Glu 


265 










270 






Ala 


Glu 


He 


Ser 


Met 


Thr 


Asp 


He 










285 








Trp 


Asp 


Tyr 


Gly 


He 


Ala 


Asp 


Val 








300 










Glu 


He 


Ala 


Pro 


Thr 


Ala 


Val 


Asp 






315 










320 



<210> 219 
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<211> 38 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 219 

Met Ser Gly Val Val Arg Ser Ser 
1 5 
Pro Pro Pro His His Pro Pro Ser 
20 

Pro Val He Pro Pro He 
35 



Pro Gly Ser Ser Gin Pro Pro Pro 

10 15 
Ser Pro Val Pro Val Thr Ser Thr 
25 30 



<210> 220 

<211> 142 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 220 



Met 


Ser 


Met 


Glu 


Met 


Glu 


Leu 


Phe 


Val 


Thr 


Pro 


Glu 


Lys 


Gin 


Arg 


Gin 


1 








5 










10 










15 




His 


Pro 


Ser 


Val 
20 


Ser 


Val 


Glu 


Lys 


Thr 
25 


Pro 


Val 


Arg 


Arg 


Lys 
30 


Leu 


He 


Val 


Asp 


Asp 


Asp 


Ser 


Glu 


He 


Gly 


Ser 


Glu 


Lys 


Lys 


Gly 


Gin 


Ser Arg 






35 










40 










45 








Thr 


Ser 


Gly 


Gly 


Gly Leu Arg 


Gin 


Phe 


Ser 


Val 


Met 


Val 


Cys 


Gin 


Lys 




50 










55 










60 










Leu 


Glu 


Ala 


Lys 


Lys 


He 


Thr 


Thr 


Tyr 


Lys 


Glu 


Val 


Ala 


Asp 


Glu 


He 


65 






70 










75 










80 


He 


Ser 


Asp 


Phe 


Ala 
85 


Thr 


lie 


Lys 


Gin 


Asn 
90 


Ala 


Glu 


Lys 


Pro 


Leu 
95 


Asn 


Glu 


Asn 


Glu 


Tyr 
100 


Asn 


Glu 


Lys 


Asn 


He 
105 


Arg 


Arg 


Arg 


Val 


Tyr 
110 


Asp 


Ala 


Leu 


Asn 


Val 


Phe 


Met 


Ala 


Leu 


Asp 


He 


He 


Ala 


Arg Asp Lys 


Lys 


Glu 






115 










120 










125 








He 


Arg 
130 


Trp 


Lys 


Gly 


Leu 


Pro 
135 


He 


Thr 


Cys 


Lys 


Lys 
140 


Asp 


Val 







<210> 221 

<211> 150 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 221 



Glu 


Glu 


Val 


Lys 


Met 


Asp 


Arg 


Asn 


Lys 


Val 


Met 


Ser 


Ser 


Val 


Gin 


Lys 


1 








5 










10 










15 




Lys 


Ala 


Ala 


Phe 


Leu 


Lys 


Glu 


Leu 


Arg 


Glu 


Lys 


Val 


Ser 


Ser 


Leu 


Glu 






20 










25 










30 






Ser 


Leu 


Met 


Ser 


Arg 


Asn 


Gin 


Glu 


Met 


Val 


Val 


Lys 


Thr Gin Gly 


Pro 






35 










40 










45 








Ala 


Glu 


Gly 


Phe 


Thr 


Leu 


Pro 


Phe 


He 


Leu 


Leu 


Glu 


Thr 


Asn 


Pro 


His 




50 








55 










60 










Ala 


Val 


Val 


Glu 


He 


Glu 


He 


Ser 


Glu Asp 


Met 


Gin 


Leu 


Val 


His 


Leu 


65 










70 










75 










80 


Asp 


Phe 


Asn 


Ser 


Thr 


Pro 


Phe 


Ser 


Val 


His 


Asp 


Asp 


Ala 


Tyr 


He 


Leu 








85 










90 










95 
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Lys Leu Met Gin Glu Gin Lys Gin Glu Gin Asn Arg Val Ser Ser Ser 

100 105 110 

Ser Ser Thr His His Gin Ser Gin His Ser Ser Ala His Ser Ser Ser 

115 120 125 

Ser Ser Cys lie Ala Ser Gly Thr fSer Gly Pro Val Cys Trp Asn Ser 

130 "135 14 0 

Gly Ser lie Asp Thr Arg 
145 150 



<210> 222 
<211> 71 
<212> PRT 

<213> Arabidopsis thaliana 
<4 00> 222 

Glu Glu Val Lys Met Asp Arg Asn Lys Val Met Ser Ser Val Gin Lys 

1 5 10 15 

Lys Ala Ala Phe Leu Lys Glu Leu Arg Glu Lys Val Ser Ser Leu Glu 

20 25 30 

Ser Leu Met Ser Arg Asn Gin Glu Met Val Val Lys Thr Gin Gly Pro 

35 40 45 

Ala Glu Gly Phe Thr Leu Pro Phe lie Leu Leu Glu Thr Asn Pro His 

50 55 60 

Ala Val Val Glu He Glu lie 
65 70 



<210> 223 

<211> 262 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 223 



Met 


Thr 


Thr 


Thr 


Gly 


Ser 


Asn 


Ser 


Asn 


His 


Asn 


His 


His 


Glu 


Ser 


Asn 


1 








5 










10 










15 




Asn 


Asn 


Asn 


Asn 
20 


Asn 


Pro 


Ser 


Thr 


Arg 
25 


Ser 


Trp 


Gly 


Thr 


Ala 
30 


Val 


Ser 


Gly 


Gin 


Ser 


Val 


Ser 


Thr 


Ser 


Gly 


Ser 


Met 


Gly 


Ser 


Pro 


Ser 


Ser 


Arg 




35 










40 










45 








Ser 


Glu 
50 


Gin 


Thr 


He 


Thr 


Val 
55 


Val 


Thr 


Ser 


Thr 


Ser 
60 


Asp 


Thr 


Thr 


Phe 


Gin 


Arg 


Leu 


Asn 


Asn 


Leu 


Asp 


He 


Gin 


Gly 


Asp 


Asp 


Ala 


Gly 


Ser 


Gin 


65 








70 










75 










80 


Gly 


Ala 


Ser 


Gly 


Val 


Lys 


Lys 


Lys 


Lys 


Arg 


Gly 


Gin 


Arg 


Ala 


Ala 


Gly 








85 










90 










95 




Pro 


Asp 


Lys 


Thr 


Gly 


Arg 


Gly 


Leu 


Arg 


Gin 


Phe 


Ser 


Met 


Lys 


Val 


Cys 




100 










105 










110 






Glu 


Lys 


Val 


Glu 


Ser 


L Y S 


Gly 


Arg 


Thr 


Thr 


Tyr 


Asn 


Glu 


Val 


Ala 


Asp 




115 










120 










125 








Glu 


Leu 
130 


Val 


Ala 


Glu 


Phe 


Ala 
135 


Leu 


Pro 


Asn 


Asn 


Asp 
140 


Gly 


Thr 


§er 


Pro 


Asp 


Gin 


Gin 


Gin 


Tyr 


Asp 


Glu 


Lys 


Asn 


He 


Arg 


Arg 


Arg 


Val 


Tyr 


Asp 


145 










150 










155 










160 


Ala 


Leu 


Asn 


Val 


Leu 
165 


Met 


Ala 


Met 


Asp 


He 
170 


He 


Ser 


Lys 


Asp 


Lys 
175 


Lys 


Glu 


He 


Gin 


Trp 


Arg 


Gly 


Leu 


Pro 


Arg 


Thr 


Ser 


Leu 


Ser 


Asp 


He 


Glu 
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180 










185 










190 






Glu 


Leu 


Lys 
195 


Asn 


Glu 


Arg 


Leu 


Ser 
200 


Leu 


Arg 


Asn 


Arg 


He 
205 


Glu 


Lys 


Lys 


Thr 


Ala 


Tyr 


Ser 


Gin 


Glu 


Leu 


Glu 


Glu 


Gin 


Tyr 


Val 


Gly 


Leu 


Gin 


Asn 




210 








215 










220 










Leu 


He 


Gin 


Arg 


Asn 


Glu 


His 


Leu 


Tyr 


Ser 


Ser 


Gly Asn 


Ala 


Pro 


Ser 


225 










230 










235 










240 


Gly 


Gly 


Val 


Ala 


Leu 


Pro 


Phe 


He 


Leu 


Val 


Gin 


Thr 


Arg 


Pro 


His 


Ala 






245 










250 










255 




Thr 


Val 


Glu 


Val 
260 


Glu 


He 























<210> 224 
<211> 51 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 224 

Gly Val Asp Ala Cys Pro Gly Asp Glu Asp Ala Asp Val Ser Val Leu 

1 5 " 10 15 

Gin Leu Gin Ala Glu He Glu Asn Leu Ala Leu Glu Glu Gin Ala Leu 

20 25 30 

Asp Asn Gin He Arg Gin Thr Glu Glu Arg Leu Arg Asp Leu Ser Glu 

35 40 4 5 

Asn Glu Lys 
50 



<210> 225 

<211> 121 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 225 



Gly 


Val 


Asp 


Ala 


Cys 


Pro 


Gly 


Asp 


Glu 


Asp 


Ala 


Asp 


Val 


Ser 


Val 


Leu 


1 








5 










10 










15 




Gin 


Leu 


Gin 


Ala 

20 


Glu 


lie 


Glu 


Asn 


Leu 
25 


Ala 


Leu 


Glu 


Glu 


Gin 
30 


Ala 


Leu 


Asp 


Asn 


Gin 
35 


He 


Arg 


Gin 


Thr 


Glu 
40 


Glu 


Arg 


Leu 


Arg 


Asp 
45 


Leu 


Ser 


Glu 


Asn 


Glu 
50 


Lys 


Asn 


Gin 


Lys 


Trp 
55 


Leu 


Phe 


Val 


Thr 


Glu 
60 


Glu 


Asp 


He 


Lys 


Ser 


Leu 


Pro 


Gly 


Phe 


Gin 


Asn 


Gin 


Thr 


Leu 


He 


Ala 


Val 


Lys 


Ala 


Pro 


65 








70 










75 










80 


His 


Gly 


Thr 


Thr 


Leu 


Glu 


Val 


Pro 


Asp 


Pro 


Asp 


Glu 


Ala 


Ala 


Asp 


His 








85 










90 










95 




Pro 


Gin 


Arg 


Arg 
100 


Tyr 


Arg 


He 


He 


Leu 
105 


Arg 


Ser 


Thr 


Met 


Gly 
110 


Pro 


He 


Asp 


Val 


Tyr 
115 


Leu 


Val 


Ser 


Glu 


Phe 
120 


Glu 

















<210> 226 

<211> 50 

<212> PRT 

<213> Arabidopsis thaliana 
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<400> 226 

Gly Leu Asp Val Ser Lys Pro Gly Glu Thr lie Glu Ser lie Ala Asn 

1 ~ 5 10 15 

Leu Gin Asp Glu Val Gin Asn .Leu Ala Ala Glu Glu Ala Arg Leu Asp 

20 25 30 

Asp Gin lie Arg Glu Ser Gin Glu Arg Leu Thr Ser Leu Ser Glu Asp 
35 40 -45 

Glu Asn 
50 



<210> 227 

<211> 118 

<212> PRT 

<213> Arabidopsis 



thaliana 



<400> 227 



Gly 


Leu 


Asp 


Val 


Ser 


Lys 


Pro 


Gly 


Glu 


Thr 


He 


Glu 


Ser 


He 


Ala 


Asn 


1 






5 










10 










15 




Leu 


Gin 


Asp 


Glu 


Val 


Gin 


Asn 


Leu 


Ala 


Ala 


Glu 


Glu 


Ala 


Arg 


Leu 


Asp 






20 










25 










30 






Asp 


Gin 


He 


Arg 


Glu 


Ser 


Gin 


Glu 


Arg 


Leu 


Thr 


Ser 


Leu 


Ser 


Glu 


Asp 




35 










40 










45 








Glu 


Asn 


Asn 


Lys 


Arg 


Leu 


Leu 


Phe 


Val 


Thr 


Glu 


Asn Asp 


lie 


Lys 


Asn 




50 








55 










60 










Leu 


Pro 


Cys 


Phe 


Gin 


Asn 


Lys 


Thr 


Leu 


He 


Ala 


Val 


Lys 


Ala 


Pro 


His 


65 








70 










75 










80 


Gly 


Thr 


Thr 


Leu 


Glu 


Val 


Pro 


Asp 


Pro 


Asp 


Glu 


Ala 


Gly Gly Tyr 


Gin 








85 










90 










95 




Arg 


Arg 


Tyr 


Arg 


He 


He 


Leu 


Arg 


Ser 


Thr 


Met 


Gly 


Pro 


He 


Asp 


Val 






100 










105 










110 






Tyr 


Leu 


Val 


Ser 


Gin 


Phe 























115 



<210> 228 

<211> 393 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 228 
aatcgaatac 
gtattacagc 
caaatcagac 
tggctttttg 
atagccgtca 
gaccacccac 
tacctcgtca 



tttggaaggg 
tgcaggcaga 
aaacagagga 
taactgaaga 
aagctcctca 
aaaggagata 
gcgaatttga 



agttgatgcg 
aattgaaaac 
aagattaaga 
ggatatcaag 
tggcacaact 
caggatcatt 
agggaaattc 



tgtcctggcg 
ctcgccctcg 
gacctgagcg 
agtttaccag 
ttggaagtgc 
cttagaagta 
gaa 



atgaggatgc 
aagagcaagc 
aaaatgaaaa 
gtttccagaa 
ctgatccaga 
caatgggacc 



tgacgtatct 
attagacaac 
gaatcagaaa 
ccagactctg 
tgaagcggct 
tattgacgta 



60 
120 
180 
240 
300 
360 
393 



<210> 229 

<211> 516 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 229 
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attattgcaa gggataaaaa ggaaatccgg tggaaaggac ttcctattac ctgcaaaaag 60 

gatgtggaag aagtcaagat ggatcgtaat aaagttatga gcagtgtgca aaagaaggct 120 

gcttttctta aagagttgag agaaaaggtc tcaagtcttg agagtcttat gtcgagaaat 180 

caagagatgg ttgtgaagac tcaaggccca gcagaaggat ttaccttacc attcattcta 240 

cttgagacaa accctcacgc agtagtcgaa atcgagattt ctgaagatat gcaacttgta 300 

cacctcgact tcaatagcac acctttctcg gtccatgatg atgcttacat tttgaaactg 360 

atgcaagaac agaagcaaga acagaacaga gtatcttctt cttcatctac acatcaccaa 420 

tctcaacata gctccgctca ttcttcatcc agttcttgca ttgcttctgg aacctcaggc 4 80 

ccggtttgct ggaactcggg atccattgat actcgc 516 



<210> 230 
<211> 276 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 230 

ggtcttcctc ggacaagctt aagcgacatt gaagaattaa agaacgaacg actctcactt 60 
aggaacagaa ttgagaagaa aactgcatat tcccaagaac tggaagaaca agtaatgaac 120 
atcatcgata ctctcggctt atctgcttcc tgccttcaga atctgataca gagaaatgag 180 
cacttatata gctcaggaaa tgctcccagt ggcggtgttg ctcttccttt tatccttgtc 240 
cagactcgtc ctcacgcaac agtagaagtg gagata 276 



<210> 231 

<211> 645 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 231 
ggtcttcctc 
aggaacagaa 
atcatcgata 
cacttatata 
cagactcgtc 
tttgatttca 
aagttttgtg 
cacaatttca 
gatatgtacg 
attcctatgc 
ccgtctcttc 



ggacaagctt 
ttgagaagaa 
ctctcggctt 
gctcaggaaa 
ctcacgcaac 
acagcactcc 
atcaaccgcc 
cgccagaaaa 
agactcatct 
ctgagactaa 
cagggataat 



aagcgacatt 
aactgcatat 
atctgcttcc 
tgctcccagt 
agtagaagtg 
atttgagctc 
gcaacaacca 
ccctaacaaa 
tcaatcgcaa 
caacgttact 
gaactccagc 



gaagaattaa 
tcccaagaac 
tgccttcaga 
ggcggtgttg 
gagatatcag 
cacgacgaca 
aacggtcgga 
ggccccagca 
caacatcagc 
tccagcgctg 
atgaagccgg 



agaacgaacg 
tggaagaaca 
atctgataca 
ctcttccttt 
aagatatgca 
attttgtcct 
acaacagcca 
caggtccaac 
agcattctca 
atactgctcc 
agaat 



actctcactt 
agtaatgaac 
gagaaatgag 
tatccttgtc 
gctcgtgcat 
caagactatg 
gctggtttgt 
accgcagctg 
gctacaaatc 
agtgaaatcc 



60 
120 
180 
240 
300 
3 60 
420 
480 
540 
600 
645 



<210> 232 

<211> 450 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 232 
gaagaagtca 
cttaaagagt 
atggttgtga 
acaaaccctc 
gacttcaata 
gaacagaagc 
catagctccg 
tgctggaact 



agatggatcg 
tgagagaaaa 
agactcaagg 
acgcagtagt 
gcacaccttt 
aagaacagaa 
ctcattcttc 
cgggatccat 



taataaagtt 
ggtctcaagt 
cccagcagaa 
cgaaatcgag 
ctcggtccat 
cagagtatct 
atccagttct 
tgatactcgc 



atgagcagtg 
cttgagagtc 
ggatttacct 
atttctgaag 
gatgatgctt 
tcttcttcat 
tgcattgctt 



tgcaaaagaa 
ttatgtcgag 
taccattcat 
atatgcaact 
acattttgaa 
ctacacatca 
ctggaacctc 



ggctgctttt 
aaatcaagag 
tctacttgag 
tgtacacctc 
actgatgcaa 
ccaatctcaa 
aggcccggtt 



60 
120 
180 
240 
300 
360 
420 
450 
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<210> 233 

<211> 213 

<212> .DNA 

<213> Arabidopsis thaliana 



<400> 233 

gaagaagtca agatggatcg taataaagtt atgagcagtg tgcaaaagaa ggctgctttt 

cttaaagagt tgagagaaaa ggtctcaagt cttgagagtc ttatgtcgag aaatcaagag 

atggttgtga agactcaagg cccagcagaa ggatttacct taccattcat tctacttgag 

acaaaccctc acgcagtagt cgaaatcgag att 



60 
.120 
180 
213 



<210> 234 

<211> 870- 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 234 
atgacaacta 
aaccctagta 
agtatgggct 
gacactactt 
ggagcttctg 
ggaagaggac 
ctttcatcta 
aatgaggttg 
cctgatcagc 
gtcctcatgg 
cctcggacaa 
agaattgaga 
gatactctcg 
tatagctcag 
cgtcctcacg 



ctgggtctaa 
ctaggtcttg 
ctccgtcgag 
ttcaacgcct 
gtgttaagaa 
tacgtcaatt 
aatgcctttc 
cagacgagct 
aacagtatga 
ctatggatat 
gcttaagcga 
agaaaactgc 
gcttatctgc 
gaaatgctcc 
caacagtaga 



ttctaatcac 
gggcacggcg 
ccggagtgag 
gaataatttg 
gaagaagagg 
tagtatgaaa 
aatttgtgaa 
tgttgctgaa 
tgagaaaaac 
aatatccaag 
cattgaagaa 
atattcccaa 
ttcctgcctt 
cagtggcggt 
agtggagata 



aaccaccatg 
gtttcaggtc 
caaaccatca 
gacattcaag 
ggacagcgtg 
ggtcttatct 
aaggtggaaa 
tttgcacttc 
ataagacgaa 
gataaaaaag 
ttaaagaacg 
gaactggaag 
cagaatctga 
gttgctcttc 



aaagcaataa 
aatctgtgtc 
ccgttgttac 
gtgatgatgc 
cggctggtcc 
ctttctctgc 
gcaaaggaag 
caaataacga 
gagtatatga 
aaattcaatg 
aacgactctc 
aacaagtaat 
tacagagaaa 
cttttatcct 



taacaacaat 
tactagcggc 
atctactagc 
tggttctcaa 
agataagact 
ccctatta.tg 
gacaacttac 
tggaacatcc 
tgctttaaac 
gagaggtctt 
acttaggaac 
gaacatcatc 
tgagcactta 
tgtccagact 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
870 



<210> 235 
<211> 153 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 235 

ggagttgatg cgtgtcctgg cgatgaggat gctgacgtat ctgtattaca gctgcaggca 
gaaattgaaa acctcgccct cgaagagcaa gcattagaca accaaatcag acaaacagag 
gaaagattaa gagacctgag cgaaaatgaa aag 



60 
120 
153 



<210> 236 

<211> 363 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 236 

ggagttgatg cgtgtcctgg cgatgaggat gctgacgtat ctgtattaca gctgcaggca 
gaaattgaaa acctcgccct cgaagagcaa gcattagaca accaaatcag acaaacagag 
gaaagattaa gagacctgag cgaaaatgaa aagaatcaga aatggctttt tgtaactgaa 



60 
120 
180 
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gaggatatca agagtttacc aggtttccag aaccagactc tgatagccgt caaagctcct 240 
catggcacaa ctttggaagt gcctgatcca gatgaagcgg ctgaccaccc acaaaggaga 300 
tacaggatca ttcttagaag tacaatggga cctattgacg tatacctcgt cagcgaattt 360 



<210> 237 
<211> 150 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 237 

ggtctcgatg tctcaaaacc aggagaaaca atcgaaagca tagctaacct acaggatgaa 60 
gtacaaaacc tcgcagctga ggaggcaaga ttagatgacc aaatcagaga atcacaagaa 120 
agattaacaa gcttgagtga ggatgaaaac 150 



<210> 238 

<211> 354 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 238 
ggtctcgatg 
gtacaaaacc 
agattaacaa 
gacattaaga 
ggaacaactc 
atcattctga 



tctcaaaacc 
tcgcagctga 
gcttgagtga 
acctaccatg 
ttgaggttcc 
gaagcacaat 



aggagaaaca 
ggaggcaaga 
ggatgaaaac 
cttccagaat 
agatcctgat 
gggaccaata 



atcgaaagca 
ttagatgacc 
aacaaaaggt 
aagacgctga 
gaggctggtg 
gacgtgtacc 



tagctaacct 
aaatcagaga 
tactgttcgt 
tagctgtaaa 
gttatcagag 
tagtcagtca 



acaggatgaa 
atcacaagaa 
cactgaaaac 
ggcaccgcat 
gaggtacaga 
attc 



60 
120 
180 
240 
300 
354 



<210> 239 

<211> 426 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 239 
atgagtatgg 
agcgttgaga 
tcagagaaga 
gtttgtcaga 
atttcagatt 
aatgagaaga 
attattgcaa 
gatgtg 



agatggagtt 
aaactccagt 
aagggcaatc 
agttggaagc 
ttgccacaat 
acataaggcg 
gggataaaaa 



gtttgtcact 
gagaaggaaa 
aagaacttct 
caagaagata 
taagcaaaac 
gagagtctac 
ggaaatccgg 



ccagagaagc 
ttgattgttg 
ggaggcgggc 
actacttaca 
gcagagaagc 
gatgcgctca 
tggaaaggac 



agaggcaaca 
atgatgattc 
ttcgtcaatt 
aggaggttgc 
ctttgaatga 
atgtgttcat 
ttcctattac 



tccttcagtg 
tgaaattgga 
cagtgttatg 
agacgaaatt 
aaatgagtac 
ggcgttggat 
ctgcaaaaag 



60 
120 
180 
240 
300 
360 
420 
426 



<210> 240 

<211> 7 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: motif 

<400> 240 

Met Lys Val Cys Glu Lys Val 
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1 


5 


<210> 


241 


<211> 


8 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


Description of Artificial 


<400> 


24.1 


Leu Asn Val Leu Met Ala Met Asp 


1 


5 



<210> 


242 


<211> 


8 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


Description of Artificial 


<400> 


242 


Phe Asn Ser Thr Pro Phe Glu Leu 


1 


5 


<210> 


243 


<211> 


30 


<212> 


DNA 



<213> artificial sequence 



<220> 
<223> 



Description of Artificial Sequence: primer 



<400> 243 

atagaattca tgaaagtttg tgaaaaggtg 



30 



<210> 


244 


<211> 


33 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


Description of Artificial 


<400> 


244 



atagaattcc tgaatgttct catggcaatg gat 



33 



<210> 245 

<211> 33 

<212> DNA 

<213> artificial sequence 
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<220> 

<223> Description of Artificial Sequence: primer 
<400> 245 

ataggatccc agctcaaaag gagtgctatt gaa 



<210> 246 

<211> 29 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: motif 

<400> 246 

ggggacaagt ttgtacaaaa aagcaggct 



<210> 247 

<211> 5 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: motif 

<400> 247 
tcaca 



<210> 248 

<211> 29 

<212> DNA 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence: motif 
<400> 248 

ggggaccact ttgtacaaga aagctgggt 



<210> 249 

<211> 27 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 249 

atagaattca tgtccggtgt cgtacga 27 



<210> 250 
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<211> 30 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 250 

ataggatccc acctccaatg tttctgcagc 



<210> 251 

<211> 30 

<212> DNA 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence: primer 



<400> 251 

atagaattcg agaagaaagg gcaatcaaga 30 

<210> 252 

<211> 30 

<212> DNA 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 252 

atactgcaga gaaatctcga tttcgactac 



<210> 253 

<211> 25 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 253 

gccactctca tagggttctc catcg 



25 



<210> 254 

<211> 25 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 254 

ggcatgcctc caagatcctt gaagt 



25 



i 
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<210> 255 

<211> 22 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 255 

gggtcttggt cgttttactg tt 



22 



<210> 


256 


<211> 


25 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


Description of Artificial 


<400> 


256 


ccaagacgat gacaacagat acagc 


<210> 


257 


<211> 


21 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


Description of Artificial 


<400> 


257 


ataaactaaa tcttcgctga a 


<210> 


258 


<211> 


21 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


Description of Artificial 


<400> 


258 


caaacgcgga tctgaaaaac t 


<210> 


259 


<211> 


18 


<212> 


DNA 


<213> 


artificial sequence 



25 



21 



21 



<220> 

<223> Description of Artificial Sequence: primer 
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<400> 259 
tctctcttcc aaatctcc 



18 



<210> 260 

<211> 20 

<212> DNA 

<213> artificial sequence 



<220> 

<223> Description o.f Artificial Sequence: primer 
<400> 260 

aagtctctca ctttctcact 



20 



<210> 261 

<211> 25 

<212> DNA 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 261 

ctaagctctc aagatcaaag gctta 



<210> 


262 


<211> 


25 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


Description of Artificial 


<400> 


262 



ttaacattgc aaagagtttc aaggt 25 



<210> 263 

<211> 4 

<212> PRT 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence: motif 



<400> 263 

Thr Pro Trp Lys 

1 



<210> 264 
<211> 289 
<212> PRT 
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<213> Arabidopsis thaliana 



<400> 264 



Met 


Gly 


Lys Tyr 


lie 


Arg 


Lys 


Ser 


Lys 


He 


Asp 


Gly 


Ala 


Gly Ala 


Gly 


1 






5 










10 










15 




Ala 


Gly 


Gly Gly Gly Gly 


Gly Gly 


Gly Gly 


Gly 


Glu 


Ser 


Ser 


He 


Ala 






20 










25 










30 






Leu 


Met 


Asp Val 


Val 


Ser 


Pro 


Ser 


Ser 


Ser 


Ser 


Ser 


Leu Gly Val 


Leu 






35 








40 










45 








Thr 


Arg 


Ala Lys 


Ser 


Leu 


Ala 


Leu 


Gin 


Gin 


Gin 


Gin 


Gin Arg Cys 


Leu 




50 








55 










60 










Leu 


Gin 


Lys Pro 


Ser 


Ser 


Pro 


Ser 


Ser 


Leu 


Pro 


Pro 


Thr 


Ser 


Ala 


Ser 


65 








70 










75 










80 


Pro 


Asn 


Pro Pro 


Ser 


Lys 


Gin 


Lys 


Met 


Lys 


Lys 


Lys 


Gin 


Gin 


Gin 


Met 








85 










90 










95 




Asn 


Asp 


Cys Gly 


Ser 


Tyr 


Leu 


Gin 


Leu 


Arg 


Ser 


Arg 


Arg 


Leu 


Gin 


Lys 






100 










105 










110 






Lys 


Pro 


Pro He 


Val 


Val 


He 


Arg 


Ser 


Thr 


Lys 


Arg 


Arg 


Lys 


Gin 


Gin 






115 








120 










125 








Arg 


Arg 


Asn Glu 


Thr 


Cys 


Gly Arg 


Asn 


Pro 


Asn 


Pro 


Arg 


Ser 


Asn 


Leu 




130 








135 










140 










Asp 


Ser 


He Arg 


Gly Asp 


Gly 


Ser 


Arg 


Ser 


Asp 


Ser 


Val 


Ser 


Glu 


Ser 


145 








150 










155 










160 


Val 


Val 


Phe Gly 


Lys 


Asp 


Lys 


Asp 


Leu 


He 


Ser 


Glu 


He 


Asn 


Lys 


Asp 








165 










170 










175 




Pro 


Thr 


Phe Gly 


Gin 


Asn 


Phe 


Phe 


Asp 


Leu 


Glu 


Glu 


Glu 


His 


Thr 


Gin 






180 










185 










190 






Ser 


Phe 


Asn Arg 


Thr 


Thr 


Arg 


Glu 


Ser 


Thr 


Pro 


Cys 


Ser 


Leu 


He 


Arg 






195 








200 










205 








Arg 


Pro 


Glu He 


Met 


Thr 


Thr 


Pro 


Gly 


Ser 


Ser 


Thr 


Lys 


Leu 


Asn 


He 




210 








215 










220 










Cys 


Val 


Ser Glu 


Ser 


Asn 


Gin Arg 


Glu 


Asp 


Ser 


Leu 


Ser 


Arg 


Ser 


His 


225 








230 










235 










240 


Arg 


Arg 


Arg Pro 


Thr 


Thr 


Pro 


Glu 


Met 


Asp 


Glu 


Phe 


Phe 


Ser 


Gly Ala 








245 










250 










255 




Glu 


Glu 


Glu Gin 


Gin 


Lys 


Gin 


Phe 


He 


Glu 


Lys 


Tyr 


Asn 


Phe 


Asp 


Pro 






260 










265 










270 






Val 


Asn 


Glu Gin 


Pro 


Leu 


Pro 


Gly Arg 


Phe 


Glu 


Trp 


Thr 


Lys 


Val 


Asp 



275 280 285 

Asp 



<210> 265 



<211> 
<212> 



20 
DNA 



<213>- artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 265 

cgggccccaa ataatgattt 



20 



<210> 266 
<211> 18 
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<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<4 00> 266 
gacacgggcc agagctgc 



<210> 267 

<211> 9 

<212> PRT 

<213> artificial sequence 
<220> 

<221> VARIANT 

<222> 2, 3, 5 and 6 

<223> Xaa = any amino acid 

<220> 

<221> VARIANT 

<222> 7 

<223> Xaa = lie or Val 



<220> 

<221> VARIANT 

<222> 8 

<223> Xaa = any amino acid or a stretch of any two 
subsequent amino acidS 

<220> 

<223> Description of Artificial Sequence: motif 



<400> 267 

Arg Xaa Xaa Leu Xaa Xaa Xaa Xaa Asn 
1 5 



<210> 


268 


<211> 


8 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<221> 


VARIANT 


<222> 


3 


<223> 


Xaa = any amino acid 


<220> 




<221> 


VARIANT 


<222> 


6 


<223> 


Xaa » lie or Val 


<220> 




<223> 


Description of /Artificial 
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<400> 268 

Met Arg Xaa lie Leu Xaa Asp Trp 
1 5 



<210> 


269 


<211> 


8 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<221> 


VARIANT 


<222> 


5,6,7 


<223> 


Xaa = any amino acid 


<220> 




<223> 


Description of Artificial 


<400> 


269 


Lys Tyr Glu Glu Xaa Xaa Xaa Pro 


1 


5 



motif 



<210> 


270 


<211> 


9 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<221> 


VARIANT 


<222> 


2,4,5,7 


<223> 


Xaa = any amino acid 


<220> 




<223> 


Description of Artificial 


<400> 


270 


Gly Xaa Gly Xaa Xaa Gly Xaa Val 1 



<210> 271 

<211> 10 

<212> PRT 

<213> artificial sequence 
<220> 

<221> VARIANT 

<222> 4,6,7,9 

<223> Xaa = any amino acid 



<220> 

<223> Description of Artificial Sequence: motif 



<400> 271 

His Arg Asp Xaa Lys Xaa Xaa Asn Xaa Leu 
15 10 
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<210> 272 

<211> 11 

<212> PRT 

<213> artificial sequence 
<220> 

<221> VARIANT 

<222> 2 

<223> Xaa = any amino acid or a stretch of any two 
subsequent amino acids 



<220> 

<221> VARIANT 

<222> 5,7,8,9,10 

<223> Xaa - any amino acid 



<220> 

<221> VARIANT 

<222> 3 

<223> Xaa = Trp or Tyr 
<220> 

<223> Description of Artificial Sequence: motif 



<400> 272 

Asp Xaa Xaa Ser Xaa Gly Xaa Xaa Xaa Xaa Glu 
15 10 



<210> 273 

<211> 4 

<212> PRT 

<213> artificial sequence 
<220> 

<221> VARIANT 

<222> 3 

<223> Xaa = amino acid or a stretch of any two 
subsequent amino acids 

<220> 

<221> VARIANT 

<222> 4 

<223> Xaa = Arg or^ Lys 
<220> 

<223> Description of Artificial Sequence: motif 

<400> 273 

Thr Pro Xaa Xaa 

1 



<210> 274 
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<211> 4 

<212> PRT 

<213> artificial sequence 
<220> 

<221> VARIANT 

<222> 3 

<223> Xaa — any amino acid 
<220> 

<221> VARIANT 

<222> 4 

<223> Xaa = Arg or Lys 
<220> 

<223> Description of Artificial Sequence: motif 

<400> 274 

Ser Pro Xaa Xaa 

1 



<210> 275 

<211> 4 

<212> PRT 

<213> artificial sequence 
<220> 

<221> VARIANT 

<222> 3 

<223> Xaa = any amino acid 



<220> 

<221> VARIANT 

<222> 4 

<223> Xaa = lie, Leu, Val or Met 



<220> 

<223> Description of Artificial Sequence: motif 



<400> 275 

Ser Pro Xaa Xaa 

1 



<210> 276 

<211> 4 

<212> PRT 

<213> artificial 

<220> 

<221> VARIANT 

<222> 3 

<223> Xaa = lie, 
<220> 

<221> VARIANT 



sequence 



Leu, Val or Met 
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<222> 4 

<223> Xaa = any amino acid 
<220> 

<223> Description of Artificial Sequence: motif 



<400> 276 

Ser Pro Xaa Xaa 

1 



<210> 277 

<211> 7 

<212> PRT 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence: motif 



<400> 277 

Pro Lys Lys Lys Arg Lys Val 
1 5 



<210> 278 

<211> 7 

<212> PRT 

<213> artificial sequence 
<220> 

<221> VARIANT 

<222> 3 

<223> Xaa may be a stretch of any ten subsequent amino 
acids 

<220> 

<223> Description of Artificial Sequence: motif 

<400> 278 

Lys Arg Xaa Lys Lys Lys Lys 
1 5 



<210> 279 

<211> 5 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: motif 

<400> 279 

Lys Arg Pro Arg Pro 

1 " 5 



<210> 280 
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<211> 9 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence: motif 

<400> 280 

Pro Ala Ala Lys Arg Val Lys Leu Asp 
1 5 



<210> 281 

<211> 4 

<212> PRT 

<213> artificial sequence 
<220> 

<221> VARIANT 

<222> 2,3 

<223> Xaa = any amino acid 
<220> 

<223> Description of Artificial Sequence: motif 

<400> 281 

Arg Xaa Xaa Phe 

1 



<210> .282 

<2I1> 5 

<212> PRT 

<213> artificial sequence 
<220> 

<221> VARIANT 

<222> 2,4 

<223> Xaa = any amino acid 
<220> 

<223> Description of Artificial Sequence: motif 

<400> 282 

Leu Xaa Cys Xaa Glu 

1 5 



<210> 283 

<211> 5 

<212> PRT 

<213> artificial sequence 
<220> 

<221> VARIANT 

<222> 2,4 

<223> Xaa = any amino acid 
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<220> 

<223> Description of Artificial Sequence: motif 

<4 00> 283 

Leu Xaa Ser Xaa Glu 

1.5 



<210> 284 
<211> 9 
<212> PRT 

<213> artificial sequence 
<220> 

<221> VARIANT 
<222> 3 

<223> Xaa may be a stretch of any seven subsequent amino 
. acids 

<220> 

<221> VARIANT 
<222> 5 

<223> Xaa may be a stretch of any three subsequent amino 
acids 

<220> 

<223> Description of Artificial Sequence: motif 
<400> 284 

Asp Tyr Xaa Glu Xaa Asp Leu Phe Asp 
1 5 



<210> 285 

<211> 9 

<212> PRT 

<213> artificial sequence 
<220> 

<221> VARIANT 

<222> 3 

<223> Xaa may be a stretch of any six subsequent amino 
acids 

<220> 

<221> VARIANT 

<222> 5 

<223> Xaa may be a stretch of any four subsequent amino 
acids 

<220> 

<223> Description of Artificial Sequence: motif 



<400> 285 

Asp Tyr Xaa Asp Xaa Asp Met Trp Glu 
1 5 
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<210> 286 

<211> 35 

<212> PRT 

<213> artificial sequence 



<220> 

<221> VARIANT 

<222> 1 

<223> Xaa may be Asp, Asn or no amino acid 



<220> 

<221> VARIANT 

<222> 2 

<223> Xaa = Gin or Glu 



<220> 

<221> VARIANT 

<222> 7 

<223> Xaa = Arg or Gly 
<220> 

<221> VARIANT 

<222> 10 

<223> Xaa = be Tyr or Asp 



<220> 

<221> VARIANT 

<222> 16 

<223> Xaa = Leu or Phe 



<220> 

<221> VARIANT 

<222> 19 

<223> Xaa = Met, lie. Leu or no amino acid 



<220> 

<221> VARIANT 

<222> 20 

<223> Xaa = Asp or Asn 
<220> 

<221> VARIANT 

<222> 21 

<223> Xaa = Val or lie 



<220> 

<221> VARIANT 

<222> 23 

<223> Xaa = be Ser or Ala 



<220> 

<221> VARIANT 

<222> 24 

<223> Xaa « Lys or Arg 
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<220> 

<221> VARIANT 

<222> 25 

<223> Xaa = Asp or Glu 
<220> 

<221> VARIANT 

<222> 30 

<223> Xaa = Lys, Gin, Arg or no amino acid 



<220> 

<221> VARIANT 
<222> 32 

<223>. Xaa = Arg, Lys or lie 



<220> 

<223> Description of Artificial Sequence: motif 



<400> 286 

Xaa Xaa Lys Asn lie Arg Xaa Arg 
1 5 
Met Ala Xaa Xaa Xaa lie Xaa Xaa 
20 

Gly Leu Pro 
35 



Val Xaa Asp Ala Leu Asn Val Xaa 

10 15 
Xaa Lys Lys Glu lie Xaa Trp Xaa 
25 30 



<210> 287 

<211> 37 

<212> PRT 

<213> artificial sequence 
<220> 

<221> VARIANT 

<222> 1 

<223> Xaa = Gly or Asn 
<220> 

<221> VARIANT 

<222> 2 

<223> Xaa = Lys or Arg 
<220> 

<221> VARIANT 

<222> 6 

<223> Xaa = His or Gin 



<220> 

<221> VARIANT 

<222> 9 

<223> Xaa « Met or Val 



<220> 

<221> VARIANT 

<222> 10 

<223> Xaa = Lys or Met 
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<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<^220> 



VARIANT 
11 

Xaa = lie or Val 



VARIANT 
12 

Xaa may be a stretch of any zero to seventeen 
subsequent amino acid 



VARIANT 
14 

Xaa = Glu or Gin 



VARIANT 
16 

Xaa = Val or Leu 



VARIANT 
17 

Xaa = Gin, Glu or no amino acid 



VARIANT 
18 

Xaa = Ser or no amino acid 



VARIANT 
19 

Xaa = any amino acid 



VARIANT 
21 

Xaa = Gly or Lys 



VARIANT 
22 

Xaa = Arg, lie or no amino acid 



VARIANT 
25 

Xaa = Ser or no amino acid 



VARIANT 
27 

Xaa - Asn or Lys 
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<221> VARIANT 
<222> 33 

<223> Xaa = Leu or lie 
<220> 

<221> VARIANT 
<222> 34 

<223> Xaa = Val or lie 
<220> 

<221> VARIANT 
<222> 35 

<223> Xaa = Ala or Ser 
<220> 

<221> VARIANT 
<222> 36 

<223> Xaa = Glu or Gin 
<220> 

<223> Description of Artificial Sequence: motif 
<400> 287 

Xaa Xaa Gly Leu Arg Xaa Phe Ser Xaa Xaa Xaa Xaa Cys Xaa Lys Xaa 

1 5 10 15 

Xaa Xaa Xaa Lys Xaa Xaa Thr Thr Xaa Tyr Xaa Glu Val Ala Asp Glu 

20 25 30 

Xaa Xaa Xaa Xaa Phe 





35 


<210> 


288 


<211> 


9 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<221> 


VARIANT 


<222> 


1 


<223> 


Xaa = Arg or Ser 


<220> 




<221> 


VARIANT 


<222> 


2 


<223> 


Xaa = He or Val 


<220> ' 




<221> 


VARIANT 


<222> 


3,6,8 


<223> 


Xaa - any amino acid 


<220> 




<221> 


VARIANT 


<222> 


4 


<223> 


Xaa = Gin or Lys 


<220> 
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<221> VARIANT 

<222> 7 

<223> Xaa = Leu or Ser 
<220> 

<223> Description of Artificial Sequence: motif 

<400> 288 

Xaa Xaa Xaa Xaa Lys Xaa Xaa Xaa Glu 
1 ' 5 



<210> 289 

<211> 19 

<212> PRT 

<213> artificial sequence 
<220> 

<221> VARIANT 

<222> 1 

<223> Xaa = Arg or Ser 
<220> 

<221> VARIANT 

<222> 2 

<223> Xaa - lie or Val 



<220> 

<221> VARIANT 

<222> 3,5,8 

<223> Xaa = any amino acid 
<220> 

<221> VARIANT 

<222> 4 

<223> Xaa - Gin or Lys 
<220> 

<221> VARIANT 

<222> 6 

<223> Xaa may be a stretch of any three subsequent amino 
acids 



<220> 

<221> VARIANT 

<222> 7 

<223> Xaa = Leu or Ser 



<220> 

<221> VARIANT 

<222> 10 

<223> Xaa = Leu or Met 



<220> 

<C221> VARIANT 

<222> 11 

<:223> Xaa may be a stretch of any of two or three 
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subsequent amino acids 

<220> 

<221> VARIANT 
<222> 12 

<223> Xaa = Gin or His 



<220> 

<221> VARIANT 

<222> 13 

<223> Xaa may be a stretch of any of four or five 
subsequent amino acids 

<220> 

<221> VARIANT 

<222> 16 

<223> Xaa = Val, lie or Met 



<220> 

<221> VARIANT 

<222> 17 

<223> Xaa = Gin or Glu 



<220> 

<223> Description of Artificial Sequence: motif 
<400> 289 

Xaa Xaa Xaa Xaa Lys Xaa Xaa Xaa Glu Xaa Xaa Xaa Xaa Asn Leu Xaa 
15 10 15 

Xaa Arg Asn 



<210> 290 

<211> 26 

<212> PRT 

<213> artificial sequence 
<220> 

<221> VARIANT 

<222> 1,5 

<223> Xaa = Leu or lie 
<220> 

<221> VARIANT 

<222> 6 

<223> Xaa = Val or Leu 
<220> 

<221> . VARIANT 

<222> 7,25 

<223> Xaa = amino acid 
<220> 

<221> VARIANT 

<222> 9 

<223> Xaa may be a stretch of any of three or four 
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<220> 
<221> 
<222> 
<223> 



subsequent amino acids 



VARIANT 
10 

Xaa = Thr or Val 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



VARIANT 
12 

Xaa may be a stretch of any of twelve to fourteen 
subsequent amino acids 



VARIANT 
14 

Xaa may be a stretch of any of three or four 
subsequent amino acids 



VARIANT 
16 

Xaa = Glu or Ser 



<220> 

<221> VARIANT 

<222> 17 

<223> Xaa = Met, lie, Val or Leu 



<220> 

<221> VARIANT 
<222> 21 

<223> Xaa may be a stretch of any of two subsequent amino 
acids 



<220> 

<221> VARIANT 

<222> 22 

<223> Xaa = Val or He 



<220> 

<221> VARIANT 

<222> 24 

<223> Xaa = Arg or Lys 



<220> 

<223> Description of Artificial Sequence: motif 



<400> 290 

Xaa Pro Phe lie Xaa Xaa Xaa Thr 
1 5 
Xaa His Asp Asp Xaa Xaa Leu Xaa 
20 



Xaa Xaa Val Xaa Phe Xaa Phe Xaa 

10 15 
Xaa Met 
25 
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2. PH ciaimsNos, -W* 24, 30-32, 35, 39-45 

because they relate to parts of the International Application -that do not comply with the prescribed requirements to such 

an extent that no meaningful International Search can be carried out, specifically: 

see FURTHER INFORMATION sheet PCT/ISA/210 



3 " b^ause they are dependent claims and are not drafted In accordance with the second and third sentences of Rule 6.4(a). 

Box II Observations where unity of Invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 

see additional sheet 



1 . I I As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
» 1 searchable claims. 

2. [™n As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 



3. | I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
I » covers only those claims for which fees were paid, specifically claims Nos.: 



4. I y [ No required additional search fees were timely paid by the applicant Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-45 partially 

A nucleic acid comprising SEQ ID NO: 3 or 41 and a 
polypeptide encoded by said nucleic acid comprising SEQ ID 
NO: 69 (CCP3). A vector, a cell, a host cell, a method of 
producing a polypeptide, an antibody, a method of detecting 
the presence, a kit, a method for identifying a compound, a 
method for modulating the activity, a transgenic plant, a 
method for modulating the growth of a plant, a method for 
modulating the cell cycle in a plant comprising said nucleic 
acid or polypeptide. 



2. Claims: 1-45 partially 

same as invention 1 but comprising SEQ ID NO: 6, 42, 72 and 
108 (CCP6) . 



3. Claims: 1-45 partially 

same as invention 1 but comprising SEQ ID NO: 12, 13, 45, 
78, 79 and 111 (CCP12/13) . 



4. Claims: 1-45 partially 

same as invention 1 but comprising SEQ ID NO: 29 and 95 
(CCP29) . 



5. Claims: 30-45 partially 

A method for modulating the growth of a plant, a method for 
modulating the cell cycle in a plant comprising a CCP 
modulator with SEQ 1, 39, 67 and 105 (CCP1) . 



6. Claims: 30-45 partially 

same as invention 5 but comprising SEQ ID NO: 2, 40, 68 and 
106 (CCP2) . 



7. Claims: 30-45 partially 

same as invention 5 but comprising SEQ ID NO: 4 and 70 
(CCP4) . 



8. Claims: 30-45 partially 
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same as invention 5 but comprising SEQ ID NO: 5 and 71 
(CCP5) . 


9. 


Claims: 30-45 partially 






same as invention 5 but comprising SEQ ID NO: 
72 73 108 and 109 (CCP7/8). 


7, 8, 42, 43, 


10. 


Claims: 30-45 partially 






same as invention 5 but comprising SEQ ID NO: 


9 and 75 


11. 


Claims: 30-45 partially 






same as invention 5 but comprising SEQ ID NO: 


10 and 76 

• 


12. 


riaimc* 30-45 narti all v 






same as invention 5 but comprising SEQ ID NO: 


11, 44, 77 and 


13. 


Claims: 30-45 partially 






same as invention 5 but comprising SEQ ID NO: 

11? (Pf PI 41 


14, 46, 80 and 


14. 


Claims: 30-45 partially 






same as invention 5 but comprising SEQ ID NO: 
113 (CCP151 


15, 47, 81 and 


15. 


Claims: 30-45 partially 






same as invention 5 but comprising SEQ ID NO: 
114 (CCP16). 


16, 48, 82 and 


16. 


Claims: 30-45 partially 






same as invention 5 but comprising SEQ ID NO: 
(CCP17) . 


17 and 83 


17. 


Claims: 30-45 partially 






same as invention 5 but comprising SEQ ID NO: 


18, 49, 84 and 
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115 (CCP18). 








18. 


Claims: 30-45 partially 










same as invention 5 but comprising SEQ ID NO: 
(CCP19) . 


19 and 85 




19. 


Claims: 30-45 partially 










same as invention 5 but comprising SEQ ID NO: 
86, 87 and 116 (CCP20/21) . 


20, 21, 


50, 


20. 


Claims: 30-45 partially 










same as invention 5 but comprising SEQ ID NO: 22, 51, 
117 (CCP22) . 




Gil IU 


21. 


Claims: 30-45 partially 










same as invention 5 but comprising SEQ ID NO: 
118 (CCP23). 


23, 52, 


89 


and 


22. 


Claims: 30-45 partially 






> 




same as invention 5 but comprising SEQ ID NO: 
119 (CCP24). 


24, 53, 




and 


23. 


Claims: 30-45 partially 










same as invention 5 but comprising SEQ ID NO: 
120 (CCP25). 


25, 54, 


91 


and 


24. 


Claims: 30-45 partially 










same as invention 5 but comprising SEQ ID NO: 
121 (CCP26). 


26, 55, 


92 


and 


25. 


Claims: 30-45 partially 










same as invention 5 but comprising SEQ ID NO: 27, 56, 
122 (CCP27) . 


93 


and 


26. 


Claims: 30-45 partially 










same as invention 5 but comprising SEQ ID NO: 
123 (CCP28). 


28, 57, 


94 


and 
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27. Claims: 30-45 partially 

same as invention 5 but comprising SEQ ID NO: 30, 58, 96 and 

124 (CCP30). 

28. Claims: 30-45 partially 

same as invention 5 but comprising SEQ ID NO: 31, 59, 97 and 

125 (CCP31). 

29. Claims: 30-45 partially 

same as invention 5 but comprising SEQ ID NO: 32, 60, 98 and 
125 (CCP32). 

30. Claims: 30-45 partially 

same as invention 5 but comprising SEQ ID NO: 33, 61, 99 and 
127 (CCP33) . 

31. Claims: 30-45 partially 

same as invention 5 but comprising SEQ ID NO: 34 , 62, 100 
and 128 (CCP34) . 

32. Claims: 30-45 partially 

same as invention 5 but comprising SEQ ID NO: 35, 63, 101 
and 129 (CCP35) . 

33. Claims: 30-45 partially 

same as invention 5 but comprising SEQ ID NO: 36, 64, 102 
and 130 (CCP36) . 

34. Claims: 30-45 partially 

same as invention 5 but comprising SEQ ID NO: 37, 65, 103 
and 131 (CCP37) . 

35. Claims: 30-45 partially 

same as invention 5 but comprising SEQ ID NO: 38, 66, 104 
and 132 (CCP38) . 
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36. Claims: 30-45 partially 

same as invention 5 but comprising SEQ ID NO: 205, 224, 225, 

228, 235 and 235 (AtE2Fa) . 

37. Claims: 30-45 partially 

same as invention 5 but comprising SEQ ID NO: 211, 220-222, 

229, 232, 233 and 239 (AtDPa) . 

38. Claims: 30-45 partially 

same as invention 1 but comprising SEQ ID NO: 215 , 216 , 223, 

230, 231 and 234 (AtDPb) . 

39. Claims: 30-45 partially 

same as invention 1 but comprising SEQ ID NO: 226 , 227 , 237 
and 238 (AtE2Fb) . 
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Cpnti:nuatiion of Box 1.2 

Claims Nos. : 18, 24, 30-32, 35, 39-45 



Present claims 18, 24, 30-32, 35, 39-45 relate to a product/compound 
defined ;by reference to a desirable characteristic or property, namely a 
compound that binds to a polypeptide or a CCP modulator. The claims cover 
all products/compounds having this characteristic or property, whereas 
the application provides support within the meaning of Article 6 PCT 
and/or disclosure within the meaning of Article 5 PCT for only a very 
limited number of such products/compounds . In the present case, the 
claims so lack support, and the application so lacks disclosure, that a 
meaningful search over the whole of the claimed scope is impossible. 
Independent of the above reasoning, the claims also lack clarity (Article 
6 PCT). An attempt is made to define the product/ compound by reference to 
a result to be achieved. Again, this lack of clarity in the present case 
is such as to render a meaningful search over the whole of the claimed 
scope Impossible. Consequently, the search has been carried out for those 
parts of the claims which appear to be clear, supported and disclosed, 
namely those parts relating to the products/ compounds antibodies, 
mutated variants of the polypeptide, anti sense nucleic acid and ribozyme 
mentioned in the description at pages 38-43, 47-52. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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